CVPR 2017: Honolulu, HI, USA
2017 IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2017, Honolulu, HI, USA, July 21-26, 2017. IEEE Computer Society 2017, ISBN 978-1-5386-0457-1
Xiaobo Wang, Xiaojie Guo, Zhen Lei, Changqing Zhang, Stan Z. Li:
Exclusivity-Consistency Regularized Multi-view Subspace Clustering. 1-9
Weifeng Ge, Yizhou Yu:
Borrowing Treasures from the Wealthy: Deep Transfer Learning through Selective Joint Fine-Tuning. 10-19
Kenneth Marino, Ruslan Salakhutdinov, Abhinav Gupta:
The More You Know: Using Knowledge Graphs for Image Classification. 20-28
Martin Simonovsky, Nikos Komodakis:
Dynamic Edge-Conditioned Filters in Convolutional Neural Networks on Graphs. 29-38
Ignacio Rocco, Relja Arandjelovic, Josef Sivic:
Convolutional Neural Network Architecture for Geometric Matching. 39-48
Spyridon Thermos, Georgios Th. Papadopoulos, Petros Daras, Gerasimos Potamianos:
Deep Affordance-Grounded Sensorimotor Object Recognition. 49-57
David Lopez-Paz, Robert Nishihara, Soumith Chintala, Bernhard Schölkopf, Léon Bottou:
Discovering Causal Signals in Images. 58-66
Xiyu Yu, Tongliang Liu, Xinchao Wang, Dacheng Tao:
On Compressing Deep Models by Low Rank and Sparse Decomposition. 67-76
Charles Ruizhongtai Qi, Hao Su, Kaichun Mo, Leonidas J. Guibas:
PointNet: Deep Learning on Point Sets for 3D Classification and Segmentation. 77-85
Seyed-Mohsen Moosavi-Dezfooli, Alhussein Fawzi, Omar Fawzi, Pascal Frossard:
Universal Adversarial Perturbations. 86-94
Konstantinos Bousmalis, Nathan Silberman, David Dohan, Dumitru Erhan, Dilip Krishnan:
Unsupervised Pixel-Level Domain Adaptation with Generative Adversarial Networks. 95-104
Christian Ledig, Lucas Theis, Ferenc Huszar, Jose Caballero, Andrew Cunningham, Alejandro Acosta, Andrew P. Aitken, Alykhan Tejani, Johannes Totz, Zehan Wang, Wenzhe Shi:
Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network. 105-114
Frank Michel, Alexander Kirillov, Eric Brachmann, Alexander Krull, Stefan Gumhold, Bogdan Savchynskyy, Carsten Rother:
Global Hypothesis Generation for 6D Object Pose Estimation. 115-124
Mahdi Abbaspour Tehrani, Thabo Beeler, Anselm Grundhöfer:
A Practical Method for Fully Automatic Intrinsic Camera Calibration Using Directionally Encoded Light. 125-133
Wayne Treible, Philip Saponaro, Scott Sorensen, Abhishek Kolagunda, Michael ONeal, Brian Phelan, Kelly Sherbondy, Chandra Kambhamettu:
CATS: A Color and Thermal Stereo Benchmark. 134-142
Abed Malti, Cédric Herzet:
Elastic Shape-from-Template with Spatially Sparse Deforming Forces. 143-151
Qingan Yan, Long Yang, Ling Zhang, Chunxia Xiao:
Distinguishing the Indistinguishable: Exploring Structural Ambiguities via Geodesic Context. 152-160
Dan Xu, Elisa Ricci, Wanli Ouyang, Xiaogang Wang, Nicu Sebe:
Multi-scale Continuous CRFs as Sequential Deep Networks for Monocular Depth Estimation. 161-169
Michael Schober, Amit Adam, Omer Yair, Shai Mazor, Sebastian Nowozin:
Dynamic Time-of-Flight. 170-179
Dim P. Papadopoulos, Jasper R. R. Uijlings, Frank Keller, Vittorio Ferrari:
Training Object Class Detectors with Click Supervision. 180-189
Shuran Song, Fisher Yu, Andy Zeng, Angel X. Chang, Manolis Savva, Thomas A. Funkhouser:
Semantic Scene Completion from a Single Depth Image. 190-198
Andy Zeng, Shuran Song, Matthias Nießner, Matthew Fisher, Jianxiong Xiao, Thomas A. Funkhouser:
3DMatch: Learning Local Geometric Descriptors from RGB-D Reconstructions. 199-208
Shubham Tulsiani, Tinghui Zhou, Alexei A. Efros, Jitendra Malik:
Multi-view Supervision for Single-View Reconstruction via Differentiable Ray Consistency. 209-217
Tommaso Cavallari, Stuart Golodetz, Nicholas A. Lord, Julien P. C. Valentin, Luigi di Stefano, Philip H. S. Torr:
On-the-Fly Adaptation of Regression Forests for Online Camera Relocalisation. 218-227
Yagiz Aksoy, Tunç Ozan Aydin, Marc Pollefeys:
Designing Effective Inter-Pixel Information Flow for Natural Image Matting. 228-236
Shuochen Su, Mauricio Delbracio, Jue Wang, Guillermo Sapiro, Wolfgang Heidrich, Oliver Wang:
Deep Video Deblurring for Hand-Held Cameras. 237-246
Seungjun Nah, Tae Hyun Kim, Kyoung Mu Lee:
Deep Multi-scale Convolutional Neural Network for Dynamic Scene Deblurring. 257-265
Yijun Li, Chen Fang, Jimei Yang, Zhaowen Wang, Xin Lu, Ming-Hsuan Yang:
Diversified Texture Synthesis with Feed-Forward Networks. 266-274
Zhipeng Mo, Boxin Shi, Sai-Kit Yeung, Yasuyuki Matsushita:
Radiometric Calibration for Internet Photo Collections. 275-283
Youngjung Kim, Hyungjoo Jung, Dongbo Min, Kwanghoon Sohn:
Deeply Aggregated Alternating Minimization for Image Restoration. 284-292
Wei Ke, Jie Chen, Jianbin Jiao, Guoying Zhao, Qixiang Ye:
SRN: Side-Output Residual Network for Object Symmetry Detection in the Wild. 302-310
Mihoko Shimano, Hiroki Okawa, Yuta Asano, Ryoma Bise, Ko Nishino, Imari Sato:
Wetness and Color from a Single Multispectral Image. 321-329
Yuanming Hu, Baoyuan Wang, Stephen Lin:
FC^4: Fully Convolutional Color Constancy with Confidence-Weighted Pooling. 330-339
George Trigeorgis, Patrick Snape, Iasonas Kokkinos, Stefanos Zafeiriou:
Face Normals "In-the-Wild" Using Fully Convolutional Networks. 340-349
Yvain Quéau, Tao Wu, François Lauze, Jean-Denis Durou, Daniel Cremers:
A Non-convex Variational Approach to Photometric Stereo under Inaccurate Lighting. 350-359
Kosuke Takahashi, Akihiro Miyata, Shohei Nobuhara, Takashi Matsuyama:
A Linear Extrinsic Calibration of Kaleidoscopic Imaging System from Single 3D Point. 360-368
Huu Le, Tat-Jun Chin, David Suter:
An Exact Penalty Method for Locally Convergent Maximum Consensus. 379-387
Chi Li, M. Zeeshan Zia, Quoc-Huy Tran, Xiang Yu, Gregory D. Hager, Manmohan Chandraker:
Deep Supervision with Shape Concepts for Occlusion-Aware 3D Object Parsing. 388-397
Zhuo Deng, Longin Jan Latecki:
Amodal Detection of 3D Objects: Inferring 3D Bounding Boxes from 2D Ones in RGB-Depth Images. 398-406
Guillermo Garcia-Hernando, Tae-Kyun Kim:
Transition Forests: Learning Discriminative Temporal Transitions for Action Recognition and Detection. 407-415
Pichao Wang, Wanqing Li, Zhimin Gao, Yuyao Zhang, Chang Tang, Philip Ogunbona:
Scene Flow to Action Map: A New Representation for RGB-D Based Action Recognition with Convolutional Neural Networks. 416-425
Qianru Sun, Bernt Schiele, Mario Fritz:
A Domain Based Approach to Social Relation Recognition. 435-444
Junwu Weng, Chaoqun Weng, Junsong Yuan:
Spatio-Temporal Naive-Bayes Nearest-Neighbor (ST-NBNN) for Skeleton-Based Action Recognition. 445-454
Ajjen Joshi, Soumya Ghosh, Margrit Betke, Stan Sclaroff, Hanspeter Pfister:
Personalizing Gesture Recognition Using Hierarchical Bayesian Neural Networks. 455-464
Wadim Kehl, Federico Tombari, Slobodan Ilic, Nassir Navab:
Real-Time 3D Model Tracking in Color and Depth on a Single CPU Core. 465-473
Dafang He, Xiao Yang, Chen Liang, Zihan Zhou, Alexander G. Ororbia II, Daniel Kifer, C. Lee Giles:
Multi-scale FCN with Cascaded Instance Aware Segmentation for Arbitrary Oriented Word Spotting in the Wild. 474-483
Xavier Alameda-Pineda, Andrea Pilzer, Dan Xu, Nicu Sebe, Elisa Ricci:
Viraliency: Pooling Local Virality. 484-492
Lei Zhu, Chi-Wing Fu, Michael S. Brown, Pheng-Ann Heng:
A Non-local Low-Rank Framework for Ultrasound Speckle Reduction. 493-501
Donghun Yeo, Jeany Son, Bohyung Han, Joon Hee Han:
Superpixel-Based Tracking-by-Segmentation Using Markov Chains. 511-520
Bohyung Han, Jack Sim, Hartwig Adam:
BranchOut: Regularization for Online Ensemble Tracking with Convolutional Neural Networks. 521-530


Subeesh Vasu, A. N. Rajagopalan:
From Local to Global: Edge Profiles to Camera Motion in Blurred Images. 558-567
Derya Akkaynak, Tali Treibitz, Tom Shlesinger, Yossi Loya, Raz Tamir, David Iluz:
What is the Space of Attenuation Coefficients in Underwater Computer Vision? 568-577
Zhengqin Li, Zexiang Xu, Ravi Ramamoorthi, Manmohan Chandraker:
Robust Energy Minimization for BRDF-Invariant Shape from Light Fields. 578-586
S. Alireza Golestaneh, Lina J. Karam:
Spatially-Varying Blur Detection Based on Multiscale Fused and Sorted Transform Coefficients of Gradient Magnitudes. 596-605
Yandong Guo, Cheng Lu, Jan P. Allebach, Charles A. Bouman:
Model-Based Iterative Restoration for Binary Document Image Compression with Dictionary Learning. 606-615
Seungryong Kim, Dongbo Min, Bumsub Ham, Sangryul Jeon, Stephen Lin, Kwanghoon Sohn:
FCSS: Fully Convolutional Self-Similarity for Dense Semantic Correspondence. 616-625
Philip Häusser, Alexander Mordvintsev, Daniel Cremers:
Learning by Association - A Versatile Semi-Supervised Training Method for Neural Networks. 626-635
Richard Zhang, Phillip Isola, Alexei A. Efros:
Split-Brain Autoencoders: Unsupervised Learning by Cross-Channel Prediction. 645-654
Mariano Tepper, Guillermo Sapiro:
Nonnegative Matrix Underapproximation for Robust Multiple Model Fitting. 655-663

Chong Peng, Zhao Kang, Qiang Cheng:
Subspace Clustering via Variance Regularized Ridge Regression. 682-691
Vamsi K. Ithapu, Risi Kondor, Sterling C. Johnson, Vikas Singh:
The Incremental Multiresolution Matrix Factorization Algorithm. 692-701
Eunbyung Park, Jimei Yang, Ersin Yumer, Duygu Ceylan, Alexander C. Berg:
Transformation-Grounded Image Generation Network for Novel 3D View Synthesis. 702-711
Shuhang Gu, Wangmeng Zuo, Shi Guo, Yunjin Chen, Chongyu Chen, Lei Zhang:
Learning Dynamic Guidance for Depth Image Enhancement. 712-721
Shuang Ma, Jing Liu, Chang Wen Chen:
A-Lamp: Adaptive Layout-Aware Multi-patch Deep Convolutional Neural Network for Photo Aesthetic Assessment. 722-731
Austin Stone, Hua-Yan Wang, Michael Stark, Yi Liu, D. Scott Phoenix, Dileep George:
Teaching Compositionality to CNNs. 732-741
Shixing Chen, Caojin Zhang, Ming Dong, Jialiang Le, Mike Rao:
Using Ranking-CNN for Age Estimation. 742-751
Jimmy S. J. Ren, Xiaohao Chen, Jianbo Liu, Wenxiu Sun, Jiahao Pang, Qiong Yan, Yu-Wing Tai, Li Xu:
Accurate Single Stage Detector Using Recurrent Rolling Convolution. 752-760
Chunpeng Wu, Wei Wen, Tariq Afzal, Yongmei Zhang, Yiran Chen, Hai Li:
A Compact DNN: Approaching GoogLeNet-Level Accuracy of Classification and Domain Adaptation. 761-770
Jawadul H. Bappy, Sujoy Paul, Ertem Tuncel, Amit K. Roy-Chowdhury:
The Impact of Typicality for Informative Representative Selection. 771-780
M. Ehsan Abbasnejad, Anthony R. Dick, Anton van den Hengel:
Infinite Variational Autoencoder for Semi-Supervised Learning. 781-790
Ayan Sinha, Asim Unmesh, Qixing Huang, Karthik Ramani:
SurfNet: Generating 3D Shape Surfaces Using Deep Residual Networks. 791-800
Rudrasis Chakraborty, Søren Hauberg, Baba C. Vemuri:
Intrinsic Grassmann Averages for Online Linear and Robust Subspace Learning. 801-809
Manuel HauBmann, Fred A. Hamprecht, Melih Kandemir:
Variational Bayesian Multiple Instance Learning with Gaussian Processes. 810-819
Wenjie Pei, Tadas Baltrusaitis, David M. J. Tax, Louis-Philippe Morency:
Temporal Attention-Gated Model for Robust Sequence Classification. 820-829
Sujoy Paul, Jawadul H. Bappy, Amit K. Roy-Chowdhury:
Non-uniform Subset Selection for Active Learning in Structured Data. 830-839
Gustav Larsson, Michael Maire, Gregory Shakhnarovich:
Colorization as a Proxy Task for Visual Understanding. 840-849
Hessam Bagherinezhad, Mohammad Rastegari, Ali Farhadi:
LCNN: Lookup-Based Convolutional Neural Network. 860-869
Hao Zhao, Ming Lu, Anbang Yao, Yiwen Guo, Yurong Chen, Li Zhang:
Physics Inspired Optimization on Semantic Transfer Features: An Alternative Method for Room Layout Estimation. 870-878
Anurag Arnab, Philip H. S. Torr:
Pixelwise Instance Segmentation with a Dynamically Instantiated Network. 879-888
Kai Kang, Hongsheng Li, Tong Xiao, Wanli Ouyang, Junjie Yan, Xihui Liu, Xiaogang Wang:
Object Detection in Videos with Tubelet Proposal Networks. 889-897
Cheng Da, Shibiao Xu, Kun Ding, Gaofeng Meng, Shiming Xiang, Chunhong Pan:
AMVH: Asymmetric Multi-Valued hashing. 898-906
Haiyu Zhao, Maoqing Tian, Shuyang Sun, Jing Shao, Junjie Yan, Shuai Yi, Xiaogang Wang, Xiaoou Tang:
Spindle Net: Person Re-identification with Human Body Region Guided Feature Decomposition and Fusion. 907-915
Yue Cao, Mingsheng Long, Jianmin Wang, Shichen Liu:
Deep Visual-Semantic Quantization for Efficient Image Retrieval. 916-925
Ahmet Iscen, Giorgos Tolias, Yannis S. Avrithis, Teddy Furon, Ondrej Chum:
Efficient Diffusion on Region Manifolds: Recovering Small Objects with Compact CNN Representations. 926-935
Tsung-Yi Lin, Piotr Dollár, Ross B. Girshick, Kaiming He, Bharath Hariharan, Serge J. Belongie:
Feature Pyramid Networks for Object Detection. 936-944
Hongliang Yan, Yukang Ding, Peihua Li, Qilong Wang, Yong Xu, Wangmeng Zuo:
Mind the Class Weight Bias: Weighted Maximum Mean Discrepancy for Unsupervised Domain Adaptation. 945-954
Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng:
StyleNet: Generating Attractive Visual Captions with Styles. 955-964
Leonid Karlinsky, Joseph Shtok, Yochay Tzur, Asaf Tzadok:
Fine-Grained Recognition of Thousands of Object Categories with Single-Example Training. 965-974
Yinpeng Dong, Hang Su, Jun Zhu, Bo Zhang:
Improving Interpretability of Deep Neural Networks with Semantic Information. 975-983
Yingwei Pan, Ting Yao, Houqiang Li, Tao Mei:
Video Captioning with Transferred Semantic Attributes. 984-992
Arthur Daniel Costea, Robert Varga, Sergiu Nedevschi:
Fast Boosting Based Detection Using Scale Invariant Multimodal Multiresolution Filtered Features. 993-1002
Colin Lea, Michael D. Flynn, René Vidal, Austin Reiter, Gregory D. Hager:
Temporal Convolutional Networks for Action Segmentation and Detection. 1003-1012
Si Liu, Changhu Wang, Ruihe Qian, Han Yu, Renda Bao, Yao Sun:
Surveillance Video Parsing with Single Frame Supervision. 1013-1021
Yan Yan, Chenliang Xu, Dawen Cai, Jason J. Corso:
Weakly Supervised Actor-Action Segmentation via Robust Multi-task Ranking. 1022-1031
De-An Huang, Joseph J. Lim, Li Fei-Fei, Juan Carlos Niebles:
Unsupervised Visual-Linguistic Reference Resolution in Instructional Videos. 1032-1041
Jie Qin, Li Liu, Ling Shao, Fumin Shen, Bingbing Ni, Jiaxin Chen, Yunhong Wang:
Zero-Shot Action Recognition with Error-Correcting Output Codes. 1042-1051
Bryan A. Plummer, Matthew Brown, Svetlana Lazebnik:
Enhancing Video Summarization via Vision-Language Embedding. 1052-1060
Jianwen Xie, Song-Chun Zhu, Ying Nian Wu:
Synthesizing Dynamic Patterns by Spatial-Temporal Generative ConvNet. 1061-1069
Ramakrishna Vedantam, Samy Bengio, Kevin Murphy, Devi Parikh, Gal Chechik:
Context-Aware Captions from Context-Agnostic Supervision. 1070-1079
Abhishek Das, Satwik Kottur, Khushi Gupta, Avi Singh, Deshraj Yadav, José M. F. Moura, Devi Parikh, Dhruv Batra:
Visual Dialog. 1080-1089
Yuting Zhang, Luyao Yuan, Yijie Guo, Zhiyuan He, I-An Huang, Honglak Lee:
Discriminative Bimodal Networks for Visual Localization and Detection with Natural Language Queries. 1090-1099
Zaeem Hussain, Mingda Zhang, Xiaozhong Zhang, Keren Ye, Christopher Thomas, Zuha Agha, Nathan Ong, Adriana Kovashka:
Automatic Understanding of Image and Video Advertisements. 1100-1110
Kai Chen, Hang Song, Chen Change Loy, Dahua Lin:
Discover and Learn New Objects from Documentaries. 1111-1120
Long Mai, Hailin Jin, Zhe L. Lin, Chen Fang, Jonathan Brandt, Feng Liu:
Spatial-Semantic Image Search by Visual Feature Synthesis. 1121-1130
Yongxi Lu, Abhishek Kumar, Shuangfei Zhai, Yu Cheng, Tara Javidi, Rogério Schmidt Feris:
Fully-Adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification. 1131-1140
Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng:
Semantic Compositional Networks for Visual Captioning. 1141-1150
Zhou Ren, Xiaoyu Wang, Ning Zhang, Xutao Lv, Li-Jia Li:
Deep Reinforcement Learning-Based Image Captioning with Embedding Reward. 1151-1159
Ishan Misra, Abhinav Gupta, Martial Hebert:
From Red Wine to Red Tomato: Composition with Context. 1160-1169
Subhashini Venugopalan, Lisa Anne Hendricks, Marcus Rohrbach, Raymond J. Mooney, Trevor Darrell, Kate Saenko:
Captioning Images with Diverse Objects. 1170-1178
Steven J. Rennie, Etienne Marcheret, Youssef Mroueh, Jarret Ross, Vaibhava Goel:
Self-Critical Sequence Training for Image Captioning. 1179-1195
Chengde Wan, Thomas Probst, Luc Van Gool, Angela Yao:
Crossing Nets: Combining GANs and VAEs with a Shared Latent Space for Hand Pose Estimation. 1196-1205
Shan Su, Jung Pyo Hong, Jianbo Shi, Hyun Soo Park:
Predicting Behaviors of Basketball Players from First Person Videos. 1206-1215
Grégory Rogez, Philippe Weinzaepfel, Cordelia Schmid:
LCR-Net: Localization-Classification-Regression for Human Pose. 1216-1224
Jin Sun, David W. Jacobs:
Seeing What is Not There: Learning Context to Determine Where Objects are Missing. 1234-1242
Zhiwu Huang, Chengde Wan, Thomas Probst, Luc Van Gool:
Deep Learning on Lie Groups for Skeleton-Based Action Recognition. 1243-1252
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis:
Harvesting Multiple Views for Marker-Less 3D Human Pose Annotations. 1253-1262
Georgios Pavlakos, Xiaowei Zhou, Konstantinos G. Derpanis, Kostas Daniilidis:
Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose. 1263-1272
Alexander Richard, Hilde Kuehne, Juergen Gall:
Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling. 1273-1282
Luan Tran, Xi Yin, Xiaoming Liu:
Disentangled Representation Learning GAN for Pose-Invariant Face Recognition. 1283-1292
Eldar Insafutdinov, Mykhaylo Andriluka, Leonid Pishchulin, Siyu Tang, Evgeny Levinkov, Bjoern Andres, Bernt Schiele:
ArtTrack: Articulated Multi-Person Tracking in the Wild. 1293-1301
Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh:
Realtime Multi-person 2D Pose Estimation Using Part Affinity Fields. 1302-1310
Itamar Talmi, Roey Mechrez, Lihi Zelnik-Manor:
Template Matching with Deformable Diversity Similarity. 1311-1319
Weihua Chen, Xiaotang Chen, Jianguo Zhang, Kaiqi Huang:
Beyond Triplet Loss: A Deep Quadruplet Network for Person Re-identification. 1320-1329
Kuo-Hao Zeng, Shih-Han Chou, Fu-Hsiang Chan, Juan Carlos Niebles, Min Sun:
Agent-Centric Risk Assessment: Accident Anticipation and Risky Region Localization. 1330-1338
Linchao Zhu, Zhongwen Xu, Yi Yang:
Bidirectional Multirate Reconstruction for Temporal Modeling in Videos. 1339-1348
Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi:
Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning. 1349-1358
Yunseok Jang, Yale Song, Youngjae Yu, Youngjin Kim, Gunhee Kim:
TGIF-QA: Toward Spatio-Temporal Reasoning in Visual Question Answering. 1359-1367
Yu-Chuan Su, Kristen Grauman:
Making 360° Video Watchable in 2D: Learning Videography for Click Free Viewing. 1368-1376
Rameswar Panda, Amran Bhuiyan, Vittorio Murino, Amit K. Roy-Chowdhury:
Unsupervised Adaptive Re-identification in Open World Dynamic Camera Networks. 1377-1386
Hou-Ning Hu, Yen-Chen Lin, Ming-Yu Liu, Hsien-Tzu Cheng, Yung-Ju Chang, Min Sun:
Deep 360 Pilot: Learning a Deep Agent for Piloting through 360° Sports Videos. 1396-1405
Joel Janai, Fatma Güney, Jonas Wulff, Michael J. Black, Andreas Geiger:
Slow Flow: Exploiting High-Speed Cameras for Accurate and Diverse Optical Flow Reference Data. 1406-1416
Zheng Shou, Jonathan Chan, Alireza Zareian, Kazuyuki Miyazawa, Shih-Fu Chang:
CDC: Convolutional-De-Convolutional Networks for Precise Temporal Action Localization in Untrimmed Videos. 1417-1426
Erik Wijmans, Yasutaka Furukawa:
Exploiting 2D Floorplan for Building-Scale Panorama RGBD Alignment. 1427-1435
Florian Bernard, Frank R. Schmidt, Johan Thunberg, Daniel Cremers:
A Combinatorial Solution to Non-Rigid 3D Shape-to-Image Matching. 1436-1445
Geoffrey Pascoe, Will Maddern, Michael Tanner, Pedro Pinies, Paul Newman:
NID-SLAM: Robust Monocular SLAM Using Normalised Information Distance. 1446-1455
Patrick Knöbelreiter, Christian Reinbacher, Alexander Shekhovtsov, Thomas Pock:
End-to-End Training of Hybrid CNN-CRF Models for Stereo. 1456-1465
Shubham Tulsiani, Hao Su, Leonidas J. Guibas, Alexei A. Efros, Jitendra Malik:
Learning Shape Abstractions by Assembling Volumetric Primitives. 1466-1474
Yanhua Cheng, Rui Cai, Zhiwei Li, Xin Zhao, Kaiqi Huang:
Locality-Sensitive Deconvolution Networks with Gated Fusion for RGB-D Indoor Semantic Segmentation. 1475-1483
Jaewon Kim, Ilya Reshetouski, Abhijeet Ghosh:
Acquiring Axially-Symmetric Transparent Objects Using Single-View Transmission Imaging. 1484-1492
Anh Tuan Tran, Tal Hassner, Iacopo Masi, Gérard G. Medioni:
Regressing Robust and Discriminative 3D Morphable Models with a Very Deep Neural Network. 1493-1502
Pengfei Dou, Shishir K. Shah, Ioannis A. Kakadiaris:
End-to-End 3D Face Reconstruction with Deep Neural Networks. 1503-1512
Antonio Agudo, Francesc Moreno-Noguer:
DUST: Dual Union of Spatio-Temporal Subspaces for Monocular Multiple Object 3D Reconstruction. 1513-1521
Jinwei Gu, Xiaodong Yang, Shalini De Mello, Jan Kautz:
Dynamic Facial Analysis: From Bayesian Filtering to Recurrent Neural Network. 1531-1540
Munawar Hayat, Salman Hameed Khan, Naoufel Werghi, Roland Goecke:
Joint Registration and Representation Learning for Unconstrained Face Identification. 1551-1560
Francesc Moreno-Noguer:
3D Human Pose Estimation from a Single Image via Distance Matrix Regression. 1561-1570
Anoop Cherian, Basura Fernando, Mehrtash Harandi, Stephen Gould:
Generalized Rank Pooling for Activity Recognition. 1581-1590
Judith Bütepage, Michael J. Black, Danica Kragic, Hedvig Kjellström:
Deep Representation Learning for Human Motion Prediction and Classification. 1591-1599
Maheen Rashid, Xiuye Gu, Yong Jae Lee:
Interspecies Knowledge Transfer for Facial Keypoint Detection. 1600-1609
Runpeng Cui, Hu Liu, Changshui Zhang:
Recurrent Convolutional Neural Networks for Continuous Sign Language Recognition by Staged Optimization. 1610-1618
Hasan F. M. Zaki, Faisal Shafait, Ajmal S. Mian:
Modeling Sub-Event Dynamics in First-Person Action Recognition. 1619-1628
Gaochang Wu, Mandan Zhao, Liangyong Wang, Qionghai Dai, Tianyou Chai, Yebin Liu:
Light Field Reconstruction Using Deep Convolutional Network on EPI. 1638-1646
Eddy Ilg, Nikolaus Mayer, Tonmoy Saikia, Margret Keuper, Alexey Dosovitskiy, Thomas Brox:
FlowNet 2.0: Evolution of Optical Flow Estimation with Deep Networks. 1647-1655
Qingxing Cao, Liang Lin, Yukai Shi, Xiaodan Liang, Guanbin Li:
Attention-Aware Face Hallucination via Deep Reinforcement Learning. 1656-1664
Anna Khoreva, Rodrigo Benenson, Jan Hendrik Hosang, Matthias Hein, Bernt Schiele:
Simple Does It: Weakly Supervised Instance and Semantic Segmentation. 1665-1674
Tushar Sandhan, Jin Young Choi:
Anti-Glare: Tightly Constrained Optimization for Eyeglass Reflection Removal. 1675-1684
Wenhan Yang, Robby T. Tan, Jiashi Feng, Jiaying Liu, Zongming Guo, Shuicheng Yan:
Deep Joint Rain Detection and Removal from a Single Image. 1685-1694
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi:
Radiometric Calibration from Faces in Images. 1695-1704
Bin Jin, Maria V. Ortiz Segovia, Sabine Süsstrunk:
Webly Supervised Semantic Segmentation. 1705-1714
Xueyang Fu, Jiabin Huang, Delu Zeng, Yue Huang, Xinghao Ding, John Paisley:
Removing Rain from Single Images via a Deep Detail Network. 1715-1723
Naeemullah Khan, Byung-Woo Hong, Anthony J. Yezzi, Ganesh Sundaramoorthi:
Coarse-to-Fine Segmentation with Shape-Tailored Continuum Scale Spaces. 1733-1742
Chao Peng, Xiangyu Zhang, Gang Yu, Guiming Luo, Jian Sun:
Large Kernel Matters - Improve Semantic Segmentation by Global Convolutional Network. 1743-1751
Nikolaos Arvanitopoulos, Radhakrishna Achanta, Sabine Süsstrunk:
Single Image Reflection Suppression. 1752-1760
Zhiding Yu, Chen Feng, Ming-Yu Liu, Srikumar Ramalingam:
CASENet: Deep Category-Aware Semantic Edge Detection. 1761-1770
Thomas Nestmeyer, Peter V. Gehler:
Reflectance Adaptive Filtering Improves Intrinsic Image Estimation. 1771-1780
Michael Figurnov, Maxwell D. Collins, Yukun Zhu, Li Zhang, Jonathan Huang, Dmitry P. Vetrov, Ruslan Salakhutdinov:
Spatially Adaptive Computation Time for Residual Networks. 1790-1799
Amir Roshan Zamir, Te-Lin Wu, Lin Sun, William B. Shen, Bertram E. Shi, Jitendra Malik, Silvio Savarese:
Feedback Networks. 1808-1817
Ehsan Elhamifar, M. Clara De Paolis Kaluza:
Online Summarization via Submodular and Convex Optimization. 1818-1826
Florian Chabot, Mohamed Chaouch, Jaonary Rabarisoa, Céline Teulière, Thierry Chateau:
Deep MANTA: A Coarse-to-Fine Many-Task Network for Joint 2D and 3D Vehicle Analysis from Monocular Image. 1827-1836
Yuncheng Li, Yale Song, Jiebo Luo:
Improving Pairwise Ranking for Multi-label Image Classification. 1837-1845
Yunho Jeon, Junmo Kim:
Active Convolution: Learning the Shape of Convolution for Image Classification. 1846-1854
Xun Huang, Yixuan Li, Omid Poursaeed, John E. Hopcroft, Serge J. Belongie:
Stacked Generative Adversarial Networks. 1866-1875
Can Chen, Scott McCloskey, Jingyi Yu:
Image Splicing Detection via Camera Response Function Analysis. 1876-1885
Xuanyi Dong, Junshi Huang, Yi Yang, Shuicheng Yan:
More is Less: A More Complicated Network with Less Inference Complexity. 1895-1903
Evgeny Levinkov, Jonas Uhrig, Siyu Tang, Mohamed Omran, Eldar Insafutdinov, Alexander Kirillov, Carsten Rother, Thomas Brox, Bernt Schiele, Bjoern Andres:
Joint Graph Decomposition & Node Labeling: Problem, Algorithms, Applications. 1904-1912
Zekun Hao, Yu Liu, Hongwei Qin, Junjie Yan, Xiu Li, Xiaolin Hu:
Scale-Aware Face Detection. 1913-1922
Miguel Ángel Bautista, Artsiom Sanakoyeu, Björn Ommer:
Deep Unsupervised Similarity Learning Using Partially Ordered Sets. 1923-1932
Jianwen Xie, Yifei Xu, Erik Nijkamp, Ying Nian Wu, Song-Chun Zhu:
Generative Hierarchical Learning of Sparse FRAME Models. 1933-1941
Ang Li, Jin Sun, Joe Yue-Hei Ng, Ruichi Yu, Vlad I. Morariu, Larry S. Davis:
Generating Holistic 3D Scene Abstractions for Text-Based Image Retrieval. 1942-1950
Jianan Li, Xiaodan Liang, Yunchao Wei, Tingfa Xu, Jiashi Feng, Shuicheng Yan:
Perceptual Generative Adversarial Networks for Small Object Detection. 1951-1959
Ronak Kosti, Jose M. Alvarez, Adrià Recasens, Àgata Lapedriza:
Emotion Recognition in Context. 1960-1968
Jongyoo Kim, Sanghoon Lee:
Deep Learning of Human Visual Sensitivity in Image Quality Assessment Framework. 1969-1977
Linjie Yang, Kevin D. Tang, Jianchao Yang, Li-Jia Li:
Dense Captioning with Joint Inference and Visual Context. 1978-1987
Justin Johnson, Bharath Hariharan, Laurens van der Maaten, Li Fei-Fei, C. Lawrence Zitnick, Ross B. Girshick:
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning. 1988-1997
Yicong Tian, Chen Chen, Mubarak Shah:
Cross-View Image Matching for Geo-Localization in Urban Environments. 1998-2006
Xing Xu, Fumin Shen, Yang Yang, Dongxiang Zhang, Heng Tao Shen, Jingkuan Song:
Matrix Tri-Factorization with Manifold Regularizations for Zero-Shot Learning. 2007-2016
Lluis Gomez-Bigorda, Yash Patel, Marçal Rusiñol, Dimosthenis Karatzas, C. V. Jawahar:
Self-Supervised Learning of Visual Features through Embedding Images into Text Topic Spaces. 2017-2026
Feng Zhu, Hongsheng Li, Wanli Ouyang, Nenghai Yu, Xiaogang Wang:
Learning Spatial Regularization with Image-Level Supervisions for Multi-label Image Classification. 2027-2036
Pedro Morgado, Nuno Vasconcelos:
Semantically Consistent Regularization for Zero-Shot Recognition. 2037-2046
Bin Wang, Yongsheng Gao, Changming Sun, Michael Blumenstein, John La Salle:
Can Walking and Measuring Along Chord Bunches Better Describe Leaf Shapes? 2047-2056
Qixiang Ye, Tianliang Zhang, Wei Ke, Qiang Qiu, Jie Chen, Guillermo Sapiro, Baochang Zhang:
Self-Learning Scene-Specific Pedestrian Detectors Using a Progressive Latent Model. 2057-2066
Achal Dave, Olga Russakovsky, Deva Ramanan:
Predictive-Corrective Networks for Action Detection. 2067-2076
Behrooz Mahasseni, Sinisa Todorovic, Alan Fern:
Budget-Aware Deep Semantic Video Segmentation. 2077-2086
Noureldien Hussein, Efstratios Gavves, Arnold W. M. Smeulders:
Unified Embedding and Metric Learning for Zero-Exemplar Event Detection. 2087-2096
Yunbo Wang, Mingsheng Long, Jianmin Wang, Philip S. Yu:
Spatiotemporal Pyramid Network for Video Action Recognition. 2097-2106
Zhanning Gao, Gang Hua, Dongqing Zhang, Nebojsa Jojic, Le Wang, Jianru Xue, Nanning Zheng:
ER3: A Unified Framework for Event Retrieval, Recognition and Recounting. 2107-2116
Suyog Dutt Jain, Bo Xiong, Kristen Grauman:
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos. 2117-2126
Aidean Sharghi, Jacob S. Laurel, Boqing Gong:
Query-Focused Video Summarization: Dataset, Evaluation, and a Memory Network Based Approach. 2127-2136
Chaochao Lu, Michael Hirsch, Bernhard Schölkopf:
Flexible Spatio-Temporal Networks for Video Prediction. 2137-2145
Konstantinos E. Papoutsakis, Costas Panagiotakis, Antonis A. Argyros:
Temporal Action Co-Segmentation in 3D Motion Capture Data and Videos. 2146-2155
Hyeonseob Nam, Jung-Woo Ha, Jeonghee Kim:
Dual Attention Networks for Multimodal Reasoning and Matching. 2156-2164
Namhoon Lee, Wongun Choi, Paul Vernaza, Christopher B. Choy, Philip H. S. Torr, Manmohan Chandraker:
DESIRE: Distant Future Prediction in Dynamic Scenes with Interacting Agents. 2165-2174
Xiaodan Liang, Liang Lin, Xiaohui Shen, Jiashi Feng, Shuicheng Yan, Eric P. Xing:
Interpretable Structure-Evolving LSTM. 2175-2184
Shireen Y. Elhabian, Ross T. Whitaker:
ShapeOdds: Variational Bayesian Learning of Generative Shape Models. 2185-2196
Haichen Shen, Seungyeop Han, Matthai Philipose, Arvind Krishnamurthy:
Fast Video Classification via Adaptive Cascading of Deep Models. 2197-2205
Hyun Oh Song, Stefanie Jegelka, Vivek Rathod, Kevin Murphy:
Deep Metric Learning via Facility Location. 2206-2214
Yevhen Kuznietsov, Jörg Stückler, Bastian Leibe:
Semi-Supervised Deep Learning for Monocular Depth Map Prediction. 2215-2223
Seunghoon Hong, Donghun Yeo, Suha Kwak, Honglak Lee, Bohyung Han:
Weakly Supervised Semantic Segmentation Using Web-Crawled Videos. 2224-2232
Giorgio Patrini, Alessandro Rozza, Aditya Krishna Menon, Richard Nock, Lizhen Qu:
Making Deep Neural Networks Robust to Label Noise: A Loss Correction Approach. 2233-2241
Ashish Shrivastava, Tomas Pfister, Oncel Tuzel, Joshua Susskind, Wenda Wang, Russell Webb:
Learning from Simulated and Unsupervised Images through Adversarial Training. 2242-2251
Gao Huang, Zhuang Liu, Laurens van der Maaten, Kilian Q. Weinberger:
Densely Connected Convolutional Networks. 2261-2269
Hexiang Hu, Shiyi Lan, Yuning Jiang, Zhimin Cao, Fei Sha:
FastMask: Segment Multi-scale Object Candidates in One Shot. 2280-2288
Matthew O'Toole, Felix Heide, David B. Lindell, Kai Zang, Steven Diamond, Gordon Wetzstein:
Reconstructing Transient Images from Single-Photon Sensors. 2289-2297
Li Liu, Fumin Shen, Yuming Shen, Xianglong Liu, Ling Shao:
Deep Sketch Hashing: Fast Free-Hand Sketch-Based Image Retrieval. 2298-2307
Liangqiong Qu, Jiandong Tian, Shengfeng He, Yandong Tang, Rynson W. H. Lau:
DeshadowNet: A Multi-context Embedding Deep Network for Shadow Removal. 2308-2316
Ryusuke Sagawa, Yutaka Satoh:
Illuminant-Camera Communication to Observe Moving Objects under Strong External Light by Spread Spectrum Modulation. 2317-2325
Shunsuke Saito, Lingyu Wei, Liwen Hu, Koki Nagano, Hao Li:
Photorealistic Facial Texture Inference Using Deep Neural Networks. 2326-2335
Chia-Yin Tsai, Kiriakos N. Kutulakos, Srinivasa G. Narasimhan, Aswin C. Sankaranarayanan:
The Geometry of First-Returning Photons for Non-Line-of-Sight Imaging. 2336-2344
Vijay Rengarajan, Yogesh Balaji, A. N. Rajagopalan:
Unrolling the Shutter: CNN to Correct Motion Distortions. 2345-2353
Mark Sheinin, Yoav Y. Schechner, Kiriakos N. Kutulakos:
Computational Imaging on the Electric Grid. 2363-2372
Yannick Hold-Geoffroy, Kalyan Sunkavalli, Sunil Hadap, Emiliano Gambaretto, Jean-François Lalonde:
Deep Outdoor Illumination Estimation. 2373-2382
Viktor Larsson, Kalle Åström, Magnus Oskarsson:
Efficient Solvers for Minimal Problems by Syzygy-Based Reduction. 2383-2392
Julian Straub, Trevor Campbell, Jonathan P. How, John W. Fisher III:
Efficient Global Point Cloud Alignment Using Bayesian Nonparametric Mixtures. 2403-2412
Soumyadip Sengupta, Tal Amir, Meirav Galun, Tom Goldstein, David W. Jacobs, Amit Singer, Ronen Basri:
A New Rank Constraint on Multi-view Fundamental Matrices, and Its Application to Camera Location Recovery. 2413-2421
Angela Dai, Angel X. Chang, Manolis Savva, Maciej Halber, Thomas A. Funkhouser, Matthias Nießner:
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes. 2432-2443
Jaeheung Surh, Hae-Gon Jeon, Yunwon Park, Sunghoon Im, Hyowon Ha, In So Kweon:
Noise Robust Depth from Focus Using a Ring Difference Filter. 2444-2453
Luis Gonzalo Sánchez Giraldo, Erion Hasanbelliu, Murali Rao, José C. Príncipe:
Group-Wise Point-Set Registration Based on Rényi's Second Order Entropy. 2454-2462
Haoqiang Fan, Hao Su, Leonidas J. Guibas:
A Point Set Generation Network for 3D Object Reconstruction from a Single Image. 2463-2471
Gil Elbaz, Tamar Avraham, Anath Fischer:
3D Point Cloud Registration for Localization Using a Deep Neural Network Auto-Encoder. 2472-2481
Artem Rozantsev, Sudipta N. Sinha, Debadeepta Dey, Pascal Fua:
Flight Dynamics-Based Recovery of a UAV Trajectory Using Ground Cameras. 2482-2491
Eric Brachmann, Alexander Krull, Sebastian Nowozin, Jamie Shotton, Frank Michel, Stefan Gumhold, Carsten Rother:
DSAC - Differentiable RANSAC for Camera Localization. 2492-2500
Christian Mostegel, Rudolf Prettenthaler, Friedrich Fraundorfer, Horst Bischof:
Scalable Surface Reconstruction from Point Clouds with Extreme Scale and Density Diversity. 2501-2510
Amir Arsalan Soltani, Haibin Huang, Jiajun Wu, Tejas D. Kulkarni, Joshua B. Tenenbaum:
Synthesizing 3D Shapes via Modeling Multi-view Depth Maps and Silhouettes with Deep Generative Networks. 2511-2519
Matthew Trager, Bernd Sturmfels, John F. Canny, Martial Hebert, Jean Ponce:
General Models for Rational Cameras and the Case of Two-Slit Projections. 2520-2528
Michael Strecke, Anna Alperovich, Bastian Goldluecke:
Accurate Depth and Normal Maps from Occlusion-Aware Focal Stack Symmetry. 2529-2537
Thomas Schöps, Johannes L. Schönberger, Silvano Galliani, Torsten Sattler, Konrad Schindler, Marc Pollefeys, Andreas Geiger:
A Multi-view Stereo Benchmark with High-Resolution Images and Multi-camera Videos. 2538-2547
Hiroyuki Kayaba, Yuji Kokumai:
Non-contact Full Field Vibration Measurement Based on Phase-Shifting. 2548-2556
Daniel Barath, Tekla Toth, Levente Hajder:
A Minimal Solution for Two-View Focal-Length Estimation Using Two Affine Correspondences. 2557-2565
Alexander Krull, Eric Brachmann, Sebastian Nowozin, Frank Michel, Jamie Shotton, Carsten Rother:
PoseAgent: Budget-Constrained 6D Object Pose Estimation via Reinforcement Learning. 2566-2574
Mariano Jaimez, Thomas J. Cashman, Andrew W. Fitzgibbon, Javier Gonzalez-Jimenez, Daniel Cremers:
An Efficient Background Term for 3D Reconstruction and Tracking with Smooth Surface Models. 2575-2583
Shan Li, Weihong Deng, Junping Du:
Reliable Crowdsourcing and Deep Locality-Preserving Learning for Expression Recognition in the Wild. 2584-2593
César Roberto de Souza, Adrien Gaidon, Yohann Cabon, Antonio Manuel López Peña:
Procedural Generation of Videos to Train Deep Action Recognition Networks. 2594-2604
Shanxin Yuan, Qi Ye, Björn Stenger, Siddhant Jain, Tae-Kyun Kim:
BigHand2.2M Benchmark: Hand Pose Dataset and State of the Art Analysis. 2605-2613
Riza Alp Güler, George Trigeorgis, Epameinondas Antonakos, Patrick Snape, Stefanos Zafeiriou, Iasonas Kokkinos:
DenseReg: Fully Convolutional Dense Shape Regression In-the-Wild. 2614-2623
Jian-Xun Mi, Qiankun Fu, Weisheng Li:
Adaptive Class Preserving Representation for Image Classification. 2624-2632
Devraj Mandal, Kunal N. Chaudhury, Soma Biswas:
Generalized Semantic Preserving Hashing for N-Label Cross-Modal Retrieval. 2633-2641
Xinyu Zhou, Cong Yao, He Wen, Yuzhi Wang, Shuchang Zhou, Weiran He, Jiajun Liang:
EAST: An Efficient and Accurate Scene Text Detector. 2642-2651
Ronald Clark, Sen Wang, Andrew Markham, Niki Trigoni, Hongkai Wen:
VidLoc: A Deep Spatio-Temporal Model for 6-DoF Video-Clip Relocalization. 2652-2660
Dustin Morley, Hassan Foroosh:
Improving RANSAC-Based Segmentation through CNN Encapsulation. 2661-2670
Manikanta Kotaru, Sachin Katti:
Position Tracking for Virtual Reality Using Commodity WiFi. 2671-2681
Henryk Blasinski, Joyce E. Farrell, Brian A. Wandell:
Designing Illuminant Spectral Power Distributions for Surface Classification. 2682-2691
Tsuyoshi Takatani, Takahito Aoto, Yasuhiro Mukaigawa:
One-Shot Hyperspectral Imaging Using Faced Reflectors. 2692-2700
Kaimo Lin, Nianjuan Jiang, Shuaicheng Liu, Loong-Fah Cheong, Minh N. Do, Jiangbo Lu:
Direct Photometric Alignment by Mesh Deformation. 2701-2709
Christian Bailer, Kiran Varanasi, Didier Stricker:
CNN-Based Patch Matching for Optical Flow with Thresholded Hinge Embedding Loss. 2710-2719
Samuel Schulter, Paul Vernaza, Wongun Choi, Manmohan Chandraker:
Deep Network Flow for Multi-object Tracking. 2730-2739
Kenichiro Tanaka, Yasuhiro Mukaigawa, Takuya Funatomi, Hiroyuki Kubo, Yasuyuki Matsushita, Yasushi Yagi:
Material Classification Using Frequency-and Depth-Dependent Time-of-Flight Distortion. 2740-2749
Jinsun Park, Yu-Wing Tai, Donghyeon Cho, In So Kweon:
A Unified Approach of Multi-scale Deep and Hand-Crafted Features for Defocus Estimation. 2760-2769
Dongdong Chen, Lu Yuan, Jing Liao, Nenghai Yu, Gang Hua:
StyleBank: An Explicit Representation for Neural Image Style Transfer. 2770-2779
Chen Li, Stephen Lin, Kun Zhou, Katsushi Ikeuchi:
Specular Highlight Removal in Facial Images. 2780-2789
Ying Tai, Jian Yang, Xiaoming Liu:
Image Super-Resolution via Deep Recursive Residual Network. 2790-2798
Yi-Hsuan Tsai, Xiaohui Shen, Zhe Lin, Kalyan Sunkavalli, Xin Lu, Ming-Hsuan Yang:
Deep Image Harmonization. 2799-2807
Kai Zhang, Wangmeng Zuo, Shuhang Gu, Lei Zhang:
Learning Deep CNN Denoiser Prior for Image Restoration. 2808-2817
Tai-Xiang Jiang, Ting-Zhu Huang, Xi-Le Zhao, Liang-Jian Deng, Yao Wang:
A Novel Tensor-Based Video Rain Streaks Removal Approach via Utilizing Discriminatively Intrinsic Priors. 2818-2827
Jiawang Bian, Wen-Yan Lin, Yasuyuki Matsushita, Sai-Kit Yeung, Tan-Dat Nguyen, Ming-Ming Cheng:
GMS: Grid-Based Motion Statistics for Fast, Ultra-Robust Feature Correspondence. 2828-2837
Weihong Ren, Jiandong Tian, Zhi Han, Antoni Chan, Yandong Tang:
Video Desnowing and Deraining Based on Matrix Decomposition. 2838-2847
Jose Caballero, Christian Ledig, Andrew P. Aitken, Alejandro Acosta, Johannes Totz, Zehan Wang, Wenzhe Shi:
Real-Time Video Super-Resolution with Spatio-Temporal Networks and Motion Compensation. 2848-2857
David Novotný, Diane Larlus, Andrea Vedaldi:
AnchorNet: A Weakly Supervised Network to Learn Geometry-Sensitive Features for Semantic Matching. 2867-2876
Aditya Deshpande, Jiajun Lu, Mao-Chuang Yeh, Min Jin Chong, David A. Forsyth:
Learning Diverse Image Colorization. 2877-2885
Shuai Yang, Jiaying Liu, Zhouhui Lian, Zongming Guo:
Awesome Typography: Statistics-Based Text Effects Transfer. 2886-2895

Bohan Zhuang, Lingqiao Liu, Yao Li, Chunhua Shen, Ian D. Reid:
Attend in Groups: A Weakly-Supervised Deep Learning Framework for Learning from Web Data. 2915-2924
Heng Zhang, Vishal M. Patel, Rama Chellappa:
Hierarchical Multimodal Metric Learning for Multimodal Classification. 2925-2933
Thalaiyasingam Ajanthan, Alban Desmaison, Rudy Bunel, Mathieu Salzmann, Philip H. S. Torr, M. Pawan Kumar:
Efficient Linear Programming for Dense CRFs. 2934-2942
Youngjoon Yoo, Sangdoo Yun, Hyung Jin Chang, Yiannis Demiris, Jin Young Choi:
Variational Autoencoded Regression: High Dimensional Regression of Visual Data on Complex Manifold. 2943-2952
Paul Vernaza, Manmohan Chandraker:
Learning Random-Walk Label Propagation for Weakly-Supervised Semantic Segmentation. 2953-2961
Eric Tzeng, Judy Hoffman, Kate Saenko, Trevor Darrell:
Adversarial Discriminative Domain Adaptation. 2962-2971
Yongqiang Zhang, Daming Shi, Junbin Gao, Dansong Cheng:
Low-Rank-Sparse Subspace Representation for Robust Regression. 2972-2981
Behrooz Mahasseni, Michael Lam, Sinisa Todorovic:
Unsupervised Video Summarization with Adversarial LSTM Networks. 2982-2991

Li Zhang, Tao Xiang, Shaogang Gong:
Learning a Deep Embedding Model for Zero-Shot Learning. 3010-3019
Jacob Chan, Jimmy Addison Lee, Qian Kemao:
BIND: Binary Integrated Net Descriptors for Texture-Less Object Recognition. 3020-3028
Yu-Xiong Wang, Deva Ramanan, Martial Hebert:
Growing a Brain: Fine-Tuning by Increasing Model Capacity. 3029-3038
Xiaolong Wang, Abhinav Shrivastava, Abhinav Gupta:
A-Fast-RCNN: Hard Positive Generation via Adversary for Object Detection. 3039-3048
Yin Cui, Feng Zhou, Jiang Wang, Xiao Liu, Yuanqing Lin, Serge J. Belongie:
Kernel Pooling for Convolutional Neural Networks. 3049-3058
Peng Tang, Xinggang Wang, Xiang Bai, Wenyu Liu:
Multiple Instance Detection Network with Online Instance Classifier Refinement. 3059-3067
Amaia Salvador, Nicholas Hynes, Yusuf Aytar, Javier Marín, Ferda Ofli, Ingmar Weber, Antonio Torralba:
Learning Cross-Modal Embeddings for Cooking Recipes and Food Images. 3068-3076
Yongqin Xian, Bernt Schiele, Zeynep Akata:
Zero-Shot Learning - The Good, the Bad and the Ugly. 3077-3086
Danfei Xu, Yuke Zhu, Christopher B. Choy, Li Fei-Fei:
Scene Graph Generation by Iterative Message Passing. 3097-3106
Hanwang Zhang, Zawlin Kyaw, Shih-Fu Chang, Tat-Seng Chua:
Visual Translation Embedding Network for Visual Relation Detection. 3107-3115
Ronan Sicre, Yannis S. Avrithis, Ewa Kijak, Frédéric Jurie:
Unsupervised Part Learning for Visual Recognition. 3116-3124
Vasili Ramanishka, Abir Das, Jianming Zhang, Kate Saenko:
Top-Down Visual Saliency Guided by Captions. 3135-3144
Qiong Wang, Junbin Gao, Hong Li:
Grassmannian Manifold Optimization Assisted Sparse Spectral Clustering. 3145-3153
Rohit Girdhar, Deva Ramanan, Abhinav Gupta, Josef Sivic, Bryan Russell:
ActionVLAD: Learning Spatio-Temporal Aggregation for Action Classification. 3165-3174
Fabian Caba Heilbron, Wayner Barrios, Victor Escorcia, Bernard Ghanem:
SCC: Semantic Context Cascade for Efficient Action Detection. 3175-3184
Lorenzo Baraldi, Costantino Grana, Rita Cucchiara:
Hierarchical Boundary-Aware Neural Encoder for Video Captioning. 3185-3194
Tan Yu, Yuwei Wu, Junsong Yuan:
HOPE: Hierarchical Object Prototype Encoding for Efficient Object Instance Search in Videos. 3195-3204
Ionut Cosmin Duta, Bogdan Ionescu, Kiyoharu Aizawa, Nicu Sebe:
Spatio-Temporal Vector of Locally Max Pooled Features for Action Recognition in Videos. 3205-3214
Ze-Huan Yuan, Jonathan C. Stroud, Tong Lu, Jia Deng:
Temporal Action Localization by Structured Maximal Sums. 3215-3223
Yufan Liu, Songyang Zhang, Mai Xu, Xuming He:
Predicting Salient Face in Multiple-Face Videos. 3224-3232
Damien Teney, Lingqiao Liu, Anton van den Hengel:
Graph-Structured Representations for Visual Question Answering. 3233-3241
Jiasen Lu, Caiming Xiong, Devi Parikh, Richard Socher:
Knowing When to Look: Adaptive Attention via a Visual Sentinel for Image Captioning. 3242-3250
Hyo Jin Kim, Enrique Dunn, Jan-Michael Frahm:
Learned Contextual Feature Reweighting for Image Geo-Localization. 3251-3260
Youngjae Yu, Hyungjin Ko, Jongwook Choi, Gunhee Kim:
End-to-End Concept Word Detection for Video Captioning, Retrieval, and Question Answering. 3261-3269
Xuejian Rong, Chucai Yi, Yingli Tian:
Unambiguous Text Localization and Retrieval for Cluttered Scenes. 3279-3287
Jonathan Huang, Vivek Rathod, Chen Sun, Menglong Zhu, Anoop Korattikara, Alireza Fathi, Ian Fischer, Zbigniew Wojna, Yang Song, Sergio Guadarrama, Kevin Murphy:
Speed/Accuracy Trade-Offs for Modern Convolutional Object Detectors. 3296-3297
Bo Dai, Yuqi Zhang, Dahua Lin:
Detecting Visual Relationships with Deep Relational Networks. 3298-3308
Tobias Pohlen, Alexander Hermans, Markus Mathias, Bastian Leibe:
Full-Resolution Residual Networks for Semantic Segmentation in Street Scenes. 3309-3318
David Bau, Bolei Zhou, Aditya Khosla, Aude Oliva, Antonio Torralba:
Network Dissection: Quantifying Interpretability of Deep Visual Representations. 3319-3327
Mandar Dixit, Roland Kwitt, Marc Niethammer, Nuno Vasconcelos:
AGA: Attribute-Guided Augmentation. 3328-3336
Jonathan Krause, Justin Johnson, Ranjay Krishna, Li Fei-Fei:
A Hierarchical Approach for Generating Descriptive Image Paragraphs. 3337-3345
Liang Zheng, Hengheng Zhang, Shaoyan Sun, Manmohan Chandraker, Yi Yang, Qi Tian:
Person Re-identification in the Wild. 3346-3355
Xiaolong Wang, Rohit Girdhar, Abhinav Gupta:
Binge Watching: Scaling Affordance Learning from Sitcoms. 3366-3375
Tong Xiao, Shuang Li, Bochao Wang, Liang Lin, Xiaogang Wang:
Joint Detection and Identification Feature Learning for Person Search. 3376-3385
Forrester Cole, David Belanger, Dilip Krishnan, Aaron Sarna, Inbar Mosseri, William T. Freeman:
Synthesizing Normalized Faces from Facial Identity Features. 3386-3395
Ji Lin, Liangliang Ren, Jiwen Lu, Jianjiang Feng, Jie Zhou:
Consistent-Aware Deep Learning for Person Re-identification in a Camera Network. 3396-3405
Aaron Nech, Ira Kemelmacher-Shlizerman:
Level Playing Field for Million Scale Face Recognition. 3406-3415
Oscar Koller, Sepehr Zargaran, Hermann Ney:
Re-Sign: Re-Aligned End-to-End Sequence Modelling with Deep Recurrent CNN-HMMs. 3416-3424
Timur M. Bagautdinov, Alexandre Alahi, François Fleuret, Pascal Fua, Silvio Savarese:
Social Scene Understanding: End-to-End Multi-person Action Localization and Collective Activity Recognition. 3425-3434
Hao Jiang, Kristen Grauman:
Detangling People: Individuating Multiple Close People and Their Body Parts via Region Assembly. 3435-3443
Joon Son Chung, Andrew W. Senior, Oriol Vinyals, Andrew Zisserman:
Lip Reading Sentences in the Wild. 3444-3453
Yuliang Liu, Lianwen Jin:
Deep Matching Prior Network: Toward Tighter Multi-oriented Text Detection. 3454-3461
Xiaosong Wang, Yifan Peng, Le Lu, Zhiyong Lu, Mohammadhadi Bagheri, Ronald M. Summers:
ChestX-Ray8: Hospital-Scale Chest X-Ray Database and Benchmarks on Weakly-Supervised Classification and Localization of Common Thorax Diseases. 3462-3471
Siavash Gorji, James J. Clark:
Attentional Push: A Deep Convolutional Network for Augmenting Image Salience with Shared Attention Modeling in Social Scenes. 3472-3481
Baoguang Shi, Xiang Bai, Serge J. Belongie:
Detecting Oriented Text in Natural Images by Linking Segments. 3482-3490
Federico Perazzi, Anna Khoreva, Rodrigo Benenson, Bernt Schiele, Alexander Sorkine-Hornung:
Learning Video Object Segmentation from Static Images. 3491-3500
Hao Jiang, Kristen Grauman:
Seeing Invisible Poses: Estimating 3D Body Pose from Egocentric Video. 3501-3509
Anh Nguyen, Jeff Clune, Yoshua Bengio, Alexey Dosovitskiy, Jason Yosinski:
Plug & Play Generative Networks: Conditional Iterative Generation of Images in Latent Space. 3510-3520
Licheng Yu, Hao Tan, Mohit Bansal, Tamara L. Berg:
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions. 3521-3529
Huazhe Xu, Yang Gao, Fisher Yu, Trevor Darrell:
End-to-End Learning of Driving Models from Large-Scale Video Datasets. 3530-3538
Mengmi Zhang, Keng Teck Ma, Joo-Hwee Lim, Qi Zhao, Jiashi Feng:
Deep Future Gaze: Gaze Anticipation on Egocentric Videos Using Adversarial Networks. 3539-3548
Zizhao Zhang, Yuanpu Xie, Fuyong Xing, Mason McGough, Lin Yang:
MDNet: A Semantically and Visually Interpretable Medical Image Diagnosis Network. 3549-3557
Adnane Boukhayma, Jean-Sébastien Franco, Edmond Boyer:
Surface Motion Capture Transfer with Gaussian Process Regression. 3558-3566
Jingming Dong, Xiaohan Fei, Stefano Soatto:
Visual-Inertial-Semantic Scene Representation for 3D Object Detection. 3567-3577
Nazim Haouchine, Stephane Cotin:
Template-Based Monocular 3D Recovery of Elastic Shapes Using Lagrangian Multipliers. 3578-3586
Dingwen Zhang, Junwei Han, Yang Yang, Dong Huang:
Learning Category-Specific 3D Shape Models from Weakly Labeled 2D Images. 3587-3595
Marjan Shahpaski, Luis Ricardo Sapaico, Gaspard Chevassus, Sabine Süsstrunk:
Simultaneous Geometric and Radiometric Calibration of a Projector-Camera Pair. 3596-3604
Zuzana Kukelova, Joe Kileel, Bernd Sturmfels, Tomás Pajdla:
A Clever Elimination Strategy for Efficient Minimal Solvers. 3605-3614
Jin Xie, Guoxian Dai, Fan Zhu, Yi Fang:
Learning Barycentric Representations of 3D Shapes for Sketch-Based 3D Shape Retrieval. 3615-3623
Hongsong Wang, Liang Wang:
Modeling Temporal Dynamics and Spatial Configurations of Actions Using Two-Stream Recurrent Neural Networks. 3633-3642
Yu-Wei Chao, Jimei Yang, Brian L. Price, Scott Cohen, Jia Deng:
Forecasting Human Dynamics from Static Images. 3643-3651
Zhun Zhong, Liang Zheng, Donglin Cao, Shaozi Li:
Re-ranking Person Re-identification with k-Reciprocal Encoding. 3652-3661
Jun Liu, Gang Wang, Ping Hu, Ling-Yu Duan, Alex C. Kot:
Global Context-Aware Attention LSTM Networks for 3D Action Recognition. 3671-3680
Zhen-Hua Feng, Josef Kittler, William J. Christmas, Patrik Huber, Xiaojun Wu:
Dynamic Attention-Controlled Cascaded Shape Regression Exploiting Training Data Augmentation and Fuzzy-Set Sample Weighting. 3681-3690
Jiang-Jing Lv, Xiaohu Shao, Junliang Xing, Cheng Cheng, Xi Zhou:
A Deep Regression Architecture with Two-Stage Re-initialization for High Performance Facial Landmark Detection. 3691-3700
Siyu Tang, Mykhaylo Andriluka, Bjoern Andres, Bernt Schiele:
Multiple People Tracking by Lifted Multicut and Person Re-identification. 3701-3710
George Papandreou, Tyler Zhu, Nori Kanazawa, Alexander Toshev, Jonathan Tompson, Chris Bregler, Kevin Murphy:
Towards Accurate Multi-person Pose Estimation in the Wild. 3711-3719
Vamsi Kiran Adhikarla, Marek Vinkler, Denis Sumin, Rafal K. Mantiuk, Karol Myszkowski, Hans-Peter Seidel, Piotr Didyk:
Towards a Quality Metric for Dense Light Fields. 3720-3729
Leon A. Gatys, Alexander S. Ecker, Matthias Bethge, Aaron Hertzmann, Eli Shechtman:
Controlling Perceptual Factors in Neural Style Transfer. 3730-3738
Kuan-Lun Tseng, Yen-Liang Lin, Winston H. Hsu, Chung-Yang Huang:
Joint Sequence Learning and Cross-Modality Convolution for 3D Biomedical Segmentation. 3739-3746
Biagio Brattoli, Uta Büchler, Anna-Sophia Wahl, Martin E. Schwab, Björn Ommer:
LSTM Self-Supervision for Detailed Behavior Analysis. 3747-3756
Donald G. Dansereau, Glenn Schuster, Joseph Ford, Gordon Wetzstein:
A Wide-Field-of-View Monocentric Light Field Camera. 3757-3766
Che-Han Chang, Chun-Nan Chou, Edward Y. Chang:
CLKN: Cascaded Lucas-Kanade Networks for Image Alignment. 3777-3785
Jeany Son, Mooyeol Baek, Minsu Cho, Bohyung Han:
Multi-object Tracking with Quadruplet Convolutional Neural Networks. 3786-3795
Lijun Wang, Huchuan Lu, Yifan Wang, Mengyang Feng, Dong Wang, Baocai Yin, Xiang Ruan:
Learning to Detect Salient Objects with Image-Level Supervision. 3796-3805
Dong Gong, Jie Yang, Lingqiao Liu, Yanning Zhang, Ian D. Reid, Chunhua Shen, Anton van den Hengel, Qinfeng Shi:
From Motion Blur to Motion Flow: A Deep Learning Solution for Removing Heterogeneous Motion Blur. 3806-3815
Hongteng Xu, Junchi Yan, Nils Persson, Weiyao Lin, Hongyuan Zha:
Fractal Dimension Invariant Filtering and Its CNN-Based Implementation. 3825-3833
Tatsuya Yokota, Hidekata Hontani:
Simultaneous Visual Data Completion and Denoising Based on Tensor Rank and Total Variation Minimization and Its Primal-Dual Splitting Algorithm. 3843-3851
Vassileios Balntas, Karel Lenc, Andrea Vedaldi, Krystian Mikolajczyk:
HPatches: A Benchmark and Evaluation of Handcrafted and Learned Local Descriptors. 3852-3861
Renwei Dian, Leyuan Fang, Shutao Li:
Hyperspectral Image Super-Resolution via Non-local Sparse Tensor Factorization. 3862-3871
Koteswar Rao Jerripothula, Jianfei Cai, Jiangbo Lu, Junsong Yuan:
Object Co-skeletonization with Co-segmentation. 3881-3889
Quanshi Zhang, Ruiming Cao, Ying Nian Wu, Song-Chun Zhu:
Mining Object Parts from CNNs via Active Question-Answering. 3890-3899
Xingcheng Zhang, Zhizhong Li, Chen Change Loy, Dahua Lin:
PolyNet: A Pursuit of Structural Diversity in Very Deep Networks. 3900-3908
Peng Wang, Qi Wu, Chunhua Shen, Anton van den Hengel:
The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions. 3909-3918
Naveed Akhtar, Ajmal S. Mian, Fatih Porikli:
Joint Discriminative Bayesian Dictionary and Classifier Learning. 3919-3928
Nikolay Savinov, Akihito Seki, Lubor Ladicky, Torsten Sattler, Marc Pollefeys:
Quad-Networks: Unsupervised Learning to Rank for Interest Point Detection. 3929-3937
Zhen Wei, Yao Sun, Jinqiao Wang, Hanjiang Lai, Si Liu:
Learning Adaptive Receptive Fields for Deep Image Parsing Network. 3947-3955
Samitha Herath, Mehrtash Tafazzoli Harandi, Fatih Porikli:
Learning an Invariant Hilbert Space for Domain Adaptation. 3956-3965
Jayakorn Vongkulbhisal, Fernando De la Torre, João Paulo Costeira:
Discriminative Optimization: Theory and Applications to Point Cloud Registration. 3975-3983
Yiling Wu, Shuhui Wang, Qingming Huang:
Online Asymmetric Similarity Learning for Cross-Modal Retrieval. 3984-3993
Kui Jia, Dacheng Tao, Shenghua Gao, Xiangmin Xu:
Improving Training of Deep Neural Networks via Singular Value Bounding. 3994-4002
Shuangfei Zhai, Hui Wu, Abhishek Kumar, Yu Cheng, Yongxi Lu, Zhongfei Zhang, Rogério Schmidt Feris:
S3Pool: Pooling with Stochastic Spatial Sampling. 4003-4011
Namdar Homayounfar, Sanja Fidler, Raquel Urtasun:
Sports Field Localization via Deep Structured Models. 4012-4020
Binghui Chen, Weihong Deng, Junping Du:
Noisy Softmax: Improving the Generalization Ability of DCNN via Postponing the Early Softmax Saturation. 4021-4030
Deepak Babu Sam, Shiv Surya, R. Venkatesh Babu:
Switching Convolutional Neural Network for Crowd Counting. 4031-4039
Yiwen Guo, Anbang Yao, Hao Zhao, Yurong Chen:
Network Sketching: Exploiting Binary Structure in Deep CNNs. 4040-4048
Xiaoqiang Yan, Shizhe Hu, Yangdong Ye:
Multi-task Clustering of Human Actions by Sharing Information. 4049-4057

Chao Yang, Xin Lu, Zhe Lin, Eli Shechtman, Oliver Wang, Hao Li:
High-Resolution Image Inpainting Using Multi-scale Neural Patch Synthesis. 4076-4084
Zhaofan Qiu, Ting Yao, Tao Mei:
Deep Quantization: Encoding Convolutional Activations with Deep Generative Model. 4085-4094
Jose Dolz, Ismail Ben Ayed, Christian Desrosiers:
DOPE: Distributed Optimization for Pairwise Energies. 4095-4104
Dmitry Ulyanov, Andrea Vedaldi, Victor S. Lempitsky:
Improved Texture Networks: Maximizing Quality and Diversity in Feed-Forward Stylization and Texture Synthesis. 4105-4113
Hakan Cevikalp, Bill Triggs:
Polyhedral Conic Classifiers for Visual Object Detection and Classification. 4114-4122
Juncheng Liu, Zhouhui Lian, Yi Wang, Jianguo Xiao:
Incremental Kernel Null Space Discriminant Analysis for Novelty Detection. 4123-4131
Menghua Zhai, Zachary Bessinger, Scott Workman, Nathan Jacobs:
Predicting Ground-Level Scene Layout from Aerial Imagery. 4132-4140
Xizhou Zhu, Yuwen Xiong, Jifeng Dai, Lu Yuan, Yichen Wei:
Deep Feature Flow for Video Recognition. 4141-4150
Fan Yang, Xin Li, Hong Cheng, Jianping Li, Leiting Chen:
Object-Aware Dense Semantic Correspondence. 4151-4159
Feng Liu, Tao Xiang, Timothy M. Hospedales, Wankou Yang, Changyin Sun:
Semantic Regularisation for Recurrent Image Annotation. 4160-4168
Zhi-Qi Cheng, Xiao Wu, Yang Liu, Xian-Sheng Hua:
Video2Shop: Exact Matching Clothes in Videos to Online Shopping Images. 4169-4177
Seyed A. Esmaeili, Bharat Singh, Larry S. Davis:
Fast-At: Fast Automatic Thumbnail Generation Using Deep Neural Networks. 4178-4186
Dongfei Yu, Jianlong Fu, Tao Mei, Yong Rui:
Multi-level Attention Networks for Visual Question Answering. 4187-4195
Anna Rohrbach, Marcus Rohrbach, Siyu Tang, Seong Joon Oh, Bernt Schiele:
Generating Descriptions with Grounded and Co-referenced People. 4196-4206
Saumya Jetley, Michael Sapienza, Stuart Golodetz, Philip H. S. Torr:
Straight to Shapes: Real-Time Detection of Encoded Shapes. 4207-4216
Thanh-Toan Do, Dang-Khoa Le Tan, Trung T. Pham, Ngai-Man Cheung:
Simultaneous Feature Aggregating and Hashing for Large-Scale Image Search. 4217-4226
Mahdi M. Kalayeh, Boqing Gong, Mubarak Shah:
Improving Facial Attribute Prediction Using Semantic Segmentation. 4227-4235
Dan Xu, Wanli Ouyang, Elisa Ricci, Xiaogang Wang, Nicu Sebe:
Learning Cross-Modal Deep Representations for Robust Pedestrian Detection. 4236-4244
Yang Du, Chunfeng Yuan, Bing Li, Weiming Hu, Stephen J. Maybank:
Spatio-Temporal Self-Organizing Map Deep Network for Dynamic Object Detection from Videos. 4245-4254
Tianmin Shu, Sinisa Todorovic, Song-Chun Zhu:
CERN: Confidence-Energy Recurrent Network for Group Activity Recognition. 4255-4263
Shanghang Zhang, Guanhang Wu, João P. Costeira, José M. F. Moura:
Understanding Traffic Density from Large-Scale Web Camera Data. 4264-4273
Rameswar Panda, Amit K. Roy-Chowdhury:
Collaborative Summarization of Topic-Related Videos. 4274-4283
Felix Juefei-Xu, Vishnu Naresh Boddeti, Marios Savvides:
Local Binary Convolutional Neural Networks. 4284-4293
Zequn Jie, Yunchao Wei, Xiaojie Jin, Jiashi Feng, Wei Liu:
Deep Self-Taught Learning for Weakly Supervised Object Localization. 4294-4302
Pierre Baqué, François Fleuret, Pascal Fua:
Multi-modal Mean-Fields via Cardinality-Based Clamping. 4303-4312
Chong You, Daniel P. Robinson, René Vidal:
Provable Self-Representation Based Outlier Detection in a Union of Subspaces. 4323-4332
Changqing Zhang, Qinghua Hu, Huazhu Fu, Pengfei Zhu, Xiaochun Cao:
Latent Multi-view Subspace Clustering. 4333-4341
Xiao Yang, Ersin Yumer, Paul Asente, Mike Kraley, Daniel Kifer, C. Lee Giles:
Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Networks. 4342-4351
Zhifei Zhang, Yang Song, Hairong Qi:
Age Progression/Regression by Conditional Adversarial Autoencoder. 4352-4360

Yuqian Zhang, Yenson Lau, Han-Wen Kuo, Sky Cheung, Abhay Pasupathy, John Wright:
On the Global Geometry of Sphere-Constrained Sparse Blind Deconvolution. 4381-4389
Changqun Xia, Jia Li, Xiaowu Chen, Anlin Zheng, Yu Zhang:
What is and What is Not a Salient Object? Learning Salient Object Detector by Ensembling Linear Exemplar Regressors. 4399-4407
Xiaodan Liang, Lisa Lee, Eric P. Xing:
Deep Variation-Structured Reinforcement Learning for Visual Relationship and Attribute Detection. 4408-4417
Ronghang Hu, Marcus Rohrbach, Jacob Andreas, Trevor Darrell, Kate Saenko:
Modeling Relationships in Referential Expressions with Compositional Modular Networks. 4418-4427
Prithvijit Chattopadhyay, Ramakrishna Vedantam, Ramprasaath R. Selvaraju, Dhruv Batra, Devi Parikh:
Counting Everyday Objects in Everyday Scenes. 4428-4437
Yi Li, Haozhi Qi, Jifeng Dai, Xiangyang Ji, Yichen Wei:
Fully Convolutional Instance-Aware Semantic Segmentation. 4438-4446
Shanshan Zhang, Rodrigo Benenson, Bernt Schiele:
CityPersons: A Diverse Dataset for Pedestrian Detection. 4457-4465
Harm de Vries, Florian Strub, Sarath Chandar, Olivier Pietquin, Hugo Larochelle, Aaron C. Courville:
GuessWhat?! Visual Object Discovery through Multi-modal Dialogue. 4466-4475
Jianlong Fu, Heliang Zheng, Tao Mei:
Look Closer to See Better: Recurrent Attention Convolutional Neural Network for Fine-Grained Image Recognition. 4476-4484
Lluis Castrejon, Kaustav Kundu, Raquel Urtasun, Sanja Fidler:
Annotating Object Instances with a Polygon-RNN. 4485-4493
Wenzhen Yuan, Shaoxiong Wang, Siyuan Dong, Edward H. Adelson:
Connecting Look and Feel: Associating the Visual and Tactile Properties of Physical Materials. 4494-4502
Concetto Spampinato, Simone Palazzo, Isaak Kavasidis, Daniela Giordano, Nasim Souly, Mubarak Shah:
Deep Learning Human Mind for Automated Visual Classification. 4503-4511
Eisuke Ito, Takayuki Okatani:
Self-Calibration-Based Approach to Critical Motion Sequences of Rolling-Shutter Structure from Motion. 4512-4520
Fotios Logothetis, Roberto Mecca, Roberto Cipolla:
Semi-Calibrated Near Field Photometric Stereo. 4521-4530
Ali Osman Ulusoy, Michael J. Black, Andreas Geiger:
Semantic Multi-view Stereo: Jointly Estimating Objects and Voxels. 4531-4540
Matteo Poggi, Stefano Mattoccia:
Learning to Predict Stereo Reliability Enforcing Local Consistency of Confidence Maps. 4541-4550
Tobias Palmér, Kalle Åström, Jan-Michael Frahm:
The Misty Three Point Algorithm for Relative Pose. 4551-4559
Anil Usumezbas, Ricardo Fabbri, Benjamin B. Kimia:
The Surfacing of Multiview 3D Drawings via Lofting and Occlusion Reasoning. 4560-4569
Qiuhong Ke, Mohammed Bennamoun, Senjian An, Ferdous Ahmed Sohel, Farid Boussaïd:
A New Representation of Skeleton Sequences for 3D Action Recognition. 4570-4579
Irene Kaltenmark, Benjamin Charlier, Nicolas Charon:
A General Framework for Curve and Surface Comparison and Registration with Oriented Varifolds. 4580-4589
Anil Armagan, Martin Hirzer, Peter M. Roth, Vincent Lepetit:
Learning to Align Semantic Segmentation and 2.5D Maps for Geolocalization. 4590-4597
Lu Sheng, Jianfei Cai, Tat-Jen Cham, Vladimir Pavlovic, King Ngi Ngan:
A Generative Model for Depth-Based Robust 3D Facial Pose Tracking. 4598-4607
Fabio Maninchedda, Martin R. Oswald, Marc Pollefeys:
Fast 3D Reconstruction of Faces with Glasses. 4608-4617
Tong Ke, Stergios I. Roumeliotis:
An Efficient Algebraic Solution to the Perspective-Three-Point Problem. 4618-4626
Gül Varol, Javier Romero, Xavier Martin, Naureen Mahmood, Michael J. Black, Ivan Laptev, Cordelia Schmid:
Learning from Synthetic Humans. 4627-4635
Wei-Chiu Ma, De-An Huang, Namhoon Lee, Kris M. Kitani:
Forecasting Interactive Dynamics of Pedestrians with Fictitious Play. 4636-4644
Tomas Simon, Hanbyul Joo, Iain A. Matthews, Yaser Sheikh:
Hand Keypoint Detection in Single Images Using Multiview Bootstrapping. 4645-4653
Umar Iqbal, Anton Milan, Juergen Gall:
PoseTrack: Joint Multi-person Pose Estimation and Tracking. 4654-4663
Shiyu Huang, Deva Ramanan:
Expecting the Unexpected: Training Detectors for Unusual Pedestrians with Adversarial Imposters. 4664-4673
Julieta Martinez, Michael J. Black, Javier Romero:
On Human Motion Prediction Using Recurrent Neural Networks. 4674-4683
Zhiyuan Shi, Tae-Kyun Kim:
Learning and Refining of Privileged Information-Based RNNs for Action Recognition from Depth Sequences. 4684-4693
Christoph Lassner, Javier Romero, Martin Kiefel, Federica Bogo, Michael J. Black, Peter V. Gehler:
Unite the People: Closing the Loop Between 3D and 2D Human Representations. 4704-4713
Alin-Ionut Popa, Mihai Zanfir, Cristian Sminchisescu:
Deep Multitask Architecture for Integrated 2D and 3D Human Sensing. 4714-4723
Joao Carreira, Andrew Zisserman:
Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset. 4724-4733
Chenyou Fan, Jangwon Lee, Mingze Xu, Krishna Kumar Singh, Yong Jae Lee, David J. Crandall, Michael S. Ryoo:
Identifying First-Person Camera Wearers in Third-Person Videos. 4734-4742
Victor Yurchenko, Victor S. Lempitsky:
Parsing Images of Overlapping Organisms with Deep Singling-Out Networks. 4752-4760
Zongwei Zhou, Jae Y. Shin, Lei Zhang, Suryakanth R. Gurudu, Michael B. Gotway, Jianming Liang:
Fine-Tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally. 4761-4772
Huixuan Tang, Scott Cohen, Brian L. Price, Stephen Schiller, Kiriakos N. Kutulakos:
Depth from Defocus in the Wild. 4773-4781
Chao Liu, Srinivasa G. Narasimhan, Artur W. Dubrawski:
Matting and Depth Recovery of Thin Structures Using a Focal Stack. 4782-4790
Yinlin Hu, Yunsong Li, Rui Song:
Robust Interpolation of Correspondences for Large Displacement Optical Flow. 4791-4799
Mengmeng Wang, Yong Liu, Zeyi Huang:
Large Margin Object Tracking with Circulant Feature Maps. 4800-4808
Tianzhu Zhang, Changsheng Xu, Ming-Hsuan Yang:
Multi-task Correlation Particle Filter for Robust Object Tracking. 4819-4827
Jongwon Choi, Hyung Jin Chang, Sangdoo Yun, Tobias Fischer, Yiannis Demiris, Jin Young Choi:
Attentional Correlation Filter Network for Adaptive Visual Tracking. 4828-4837
Denys Rozumnyi, Jan Kotera, Filip Sroubek, Lukás Novotný, Jiri Matas:
The World of Fast Moving Objects. 4838-4846
Alan Lukezic, Tomas Vojir, Luka Cehovin Zajc, Jiri Matas, Matej Kristan:
Discriminative Correlation Filter with Channel and Spatial Reliability. 4847-4856
Yueqi Duan, Jiwen Lu, Ziwei Wang, Jianjiang Feng, Jie Zhou:
Learning Deep Binary Descriptor with Multi-quantization. 4857-4866
Jun Guo, Hongyang Chao:
One-To-Many Network for Visually Pleasing Compression Artifacts Reduction. 4867-4876
Md. Amirul Islam, Mrigank Rochan, Neil D. B. Bruce, Yang Wang:
Gated Feedback Refinement Network for Dense Image Labeling. 4877-4885
Hao Guan, William A. P. Smith:
BRISKS: Binary Features for Spherical Images on a Geodesic Grid. 4886-4894
Radhakrishna Achanta, Sabine Süsstrunk:
Superpixels and Polygons Using Simple Non-iterative Clustering. 4895-4904
Longquan Dai, Mengke Yuan, Zechao Li, Xiaopeng Zhang, Jinhui Tang:
Hardware-Efficient Guided Image Filtering for Multi-label Problem. 4905-4913
Xu Zhang, Felix X. Yu, Svebor Karaman, Shih-Fu Chang:
Learning Discriminative and Transformation Covariant Local Feature Detectors. 4923-4931
Chengjiang Long, Gang Hua:
Correlational Gaussian Processes for Cross-Domain Visual Recognition. 4932-4940
Swaminathan Gurumurthy, Ravi Kiran Sarvadevabhatla, R. Venkatesh Babu:
DeLiGAN: Generative Adversarial Networks for Diverse and Limited Data. 4941-4949
Paul Swoboda, Jan Kuske, Bogdan Savchynskyy:
A Dual Ascent Framework for Lagrangean Decomposition of Combinatorial Problems. 4950-4960
Luan Tran, Xiaoming Liu, Jiayu Zhou, Rong Jin:
Missing Modalities Imputation via Cascaded Residual Autoencoder. 4971-4980
Hossam N. Isack, Olga Veksler, Ipek Oguz, Milan Sonka, Yuri Boykov:
Efficient Optimization for Hierarchically-Structured Interacting Segments (HINTS). 4981-4989
Paul Swoboda, Bjoern Andres:
A Message Passing Algorithm for the Minimum Cost Multicut Problem. 4990-4999
Jack Valmadre, Luca Bertinetto, João F. Henriques, Andrea Vedaldi, Philip H. S. Torr:
End-to-End Representation Learning for Correlation Filter Based Tracking. 5000-5008
Sathya N. Ravi, Yunyang Xiong, Lopamudra Mukherjee, Vikas Singh:
Filter Flow Made Practical: Massively Parallel and Lock-Free. 5009-5018
Won Hwa Kim, Mona Jalal, Seong Jae Hwang, Sterling C. Johnson, Vikas Singh:
Online Graph Completion: Multivariate Signal Recovery in Computer Vision. 5019-5027
Sanping Zhou, Jinjun Wang, Jiayun Wang, Yihong Gong, Nanning Zheng:
Point to Set Similarity Based Deep Feature Learning for Person Re-Identification. 5028-5037
Seong Joon Oh, Rodrigo Benenson, Anna Khoreva, Zeynep Akata, Mario Fritz, Bernt Schiele:
Exploiting Saliency for Object Segmentation from Image Level Labels. 5038-5047
Pablo Speciale, Danda Pani Paudel, Martin R. Oswald, Till Kroeger, Luc Van Gool, Marc Pollefeys:
Consensus Maximization with Linear Matrix Inequality Constraints. 5048-5056
Yinda Zhang, Shuran Song, Ersin Yumer, Manolis Savva, Joon-Young Lee, Hailin Jin, Thomas A. Funkhouser:
Physically-Based Rendering for Indoor Scene Understanding Using Convolutional Neural Networks. 5057-5065
Xitong Yang, Palghat Ramesh, Radha Chitta, Sriganesh Madhvanath, Edgar A. Bernal, Jiebo Luo:
Deep Multimodal Representation Learning from Temporal Data. 5066-5074
Di Xie, Jiang Xiong, Shiliang Pu:
All You Need is Beyond a Good Init: Exploring Better Solution for Training Extremely Deep Convolutional Neural Networks with Orthonormality and Modulation. 5075-5084
Sam Gross, Marc'Aurelio Ranzato, Arthur Szlam:
Hard Mixtures of Experts for Large Scale Weakly Supervised Vision. 5085-5093
Mustafa Devrim Kaba, Mustafa Gökhan Uzunbas, Ser-Nam Lim:
A Reinforcement Learning Approach to the View Planning Problem. 5094-5102
Meng Ye, Yuhong Guo:
Zero-Shot Classification with Discriminative Semantic Representation Learning. 5103-5111
Ziad Al-Halah, Rainer Stiefelhagen:
Automatic Discovery, Association Estimation and Learning of Semantic Attributes for a Thousand Categories. 5112-5121
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso, Antonio Torralba:
Scene Parsing through ADE20K Dataset. 5122-5130
Ali Diba, Vivek Sharma, Ali Mohammad Pazandeh, Hamed Pirsiavash, Luc Van Gool:
Weakly Supervised Cascaded Convolutional Networks. 5131-5139
Li Liu, Ling Shao, Fumin Shen, Mengyang Yu:
Discretely Coding Semantic Rank Orders for Supervised Image Hashing. 5140-5149
Jing Zhang, Wanqing Li, Philip Ogunbona:
Joint Geometrical and Statistical Alignment for Visual Domain Adaptation. 5150-5158
Zhiqiang Shen, Jianguo Li, Zhou Su, Minjun Li, Yurong Chen, Yu-Gang Jiang, Xiangyang Xue:
Weakly Supervised Dense Video Captioning. 5159-5167
Guosheng Lin, Anton Milan, Chunhua Shen, Ian D. Reid:
RefineNet: Multi-path Refinement Networks for High-Resolution Semantic Segmentation. 5168-5177
Falong Shen, Rui Gan, Shuicheng Yan, Gang Zeng:
Semantic Segmentation via Structured Patch Prediction, Context CRF and Guidance CRF. 5178-5186
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, Xiaogang Wang:
Person Search with Natural Language Description. 5187-5196
Johann Sawatzky, Abhilash Srikantha, Juergen Gall:
Weakly Supervised Affordance Detection. 5197-5206
Yanan Li, Donghui Wang, Huanhang Hu, Yuetan Lin, Yueting Zhuang:
Zero-Shot Recognition Using Dual Visual-Semantic Mapping Paths. 5207-5215
Jiaolong Yang, Peiran Ren, Dongqing Zhang, Dong Chen, Fang Wen, Hongdong Li, Gang Hua:
Neural Aggregation Network for Video Face Recognition. 5216-5225
Ji Zhang, Mohamed Elhoseiny, Scott Cohen, Walter Chang, Ahmed M. Elgammal:
Relationship Proposal Networks. 5226-5234
Guangrun Wang, Ping Luo, Liang Lin, Xiaogang Wang:
Learning Object Interactions and Descriptions for Semantic Image Segmentation. 5235-5243
Tao Kong, Fuchun Sun, Anbang Yao, Huaping Liu, Ming Lu, Yurong Chen:
RON: Reverse Connection with Objectness Prior Networks for Object Detection. 5244-5252
Fanyi Xiao, Leonid Sigal, Yong Jae Lee:
Weakly-Supervised Visual Grounding of Phrases with Linguistic Structures. 5253-5262
Ting Yao, Yingwei Pan, Yehao Li, Tao Mei:
Incorporating Copying Mechanism in Image Captioning for Learning Novel Objects. 5263-5271
Albert Gordo, Diane Larlus:
Beyond Instance-Level Image Retrieval: Leveraging Captions to Learn a Global Visual Representation for Semantic Retrieval. 5272-5281
Youssef Tamaazousti, Hervé Le Borgne, Céline Hudelot:
MuCaLe-Net: Multi Categorical-Level Networks to Generate More Discriminating Features. 5282-5291
Shay Deutsch, Soheil Kolouri, Kyungnam Kim, Yuri Owechko, Stefano Soatto:
Zero Shot Learning via Multi-scale Manifold Regularization. 5292-5299
Qibin Hou, Ming-Ming Cheng, Xiaowei Hu, Ali Borji, Zhuowen Tu, Philip H. S. Torr:
Deeply Supervised Salient Object Detection with Short Connections. 5300-5309
Ganzhao Yuan, Wei-Shi Zheng, Bernard Ghanem:
A Matrix Splitting Method for Composite Function Minimization. 5310-5319
Sergi Caelles, Kevis-Kokitsi Maninis, Jordi Pont-Tuset, Laura Leal-Taixé, Daniel Cremers, Luc Van Gool:
One-Shot Video Object Segmentation. 5320-5329
Jiaxin Chen, Yunhong Wang, Jie Qin, Li Liu, Ling Shao:
Fast Person Re-identification via Cross-Camera Semantic Binary Transformation. 5330-5339
Dingwen Zhang, Le Yang, Deyu Meng, Dong Xu, Junwei Han:
SPFTN: A Self-Paced Fine-Tuning Network for Segmenting Objects in Weakly Labelled Videos. 5340-5348
Zhongwen Xu, Linchao Zhu, Yi Yang:
Few-Shot Object Recognition from Machine-Labeled Web Images. 5358-5366
Xin Yu, Fatih Porikli:
Hallucinating Very Low-Resolution Unaligned and Noisy Face Images by Transformative Discriminative Autoencoders. 5367-5375
Aniruddha Kembhavi, Min Joon Seo, Dustin Schwenk, Jonghyun Choi, Ali Farhadi, Hannaneh Hajishirzi:
Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension. 5376-5384
Hemanth Venkateswara, Jose Eusebio, Shayok Chakraborty, Sethuraman Panchanathan:
Deep Hashing Network for Unsupervised Domain Adaptation. 5385-5394
Venkataraman Santhanam, Vlad I. Morariu, Larry S. Davis:
Generalized Deep Image to Image Regression. 5395-5405
Zhaowei Cai, Xiaodong He, Jian Sun, Nuno Vasconcelos:
Deep Learning with Low Precision by Half-Wave Gaussian Quantization. 5406-5414
Unnat Jain, Ziyu Zhang, Alexander G. Schwing:
Creativity: Generating Diverse Questions Using Variational Autoencoders. 5415-5424
Federico Monti, Davide Boscaini, Jonathan Masci, Emanuele Rodolà, Jan Svoboda, Michael M. Bronstein:
Geometric Deep Learning on Graphs and Manifolds Using Mixture Model CNNs. 5425-5434
George Toderici, Damien Vincent, Nick Johnston, Sung Jin Hwang, David Minnen, Joel Shor, Michele Covell:
Full Resolution Image Compression with Recurrent Neural Networks. 5435-5443
Zhixin Shu, Ersin Yumer, Sunil Hadap, Kalyan Sunkavalli, Eli Shechtman, Dimitris Samaras:
Neural Face Editing with Intrinsic Image Disentangling. 5444-5453
Iasonas Kokkinos:
UberNet: Training a Universal Convolutional Neural Network for Low-, Mid-, and High-Level Vision Using Diverse Datasets and Limited Memory. 5454-5463
James Booth, Epameinondas Antonakos, Stylianos Ploumpis, George Trigeorgis, Yannis Panagakis, Stefanos Zafeiriou:
3D Face Morphable Models "In-the-Wild". 5464-5473
Miroslava Slavcheva, Maximilian Baust, Daniel Cremers, Slobodan Ilic:
KillingFusion: Non-rigid 3D Reconstruction without Correspondences. 5474-5483
Chao Zhang, Sergi Pujades, Michael J. Black, Gerard Pons-Moll:
Detailed, Accurate, Human Shape Estimation from Clothed 3D Scan Sequences. 5484-5493
Guido Borghi, Marco Venturelli, Roberto Vezzani, Rita Cucchiara:
POSEidon: Face-from-Depth for Driver Pose Estimation. 5494-5503
Endri Dibra, Himanshu Jain, A. Cengiz Öztireli, Remo Ziegler, Markus H. Gross:
Human Shape from Silhouettes Using Generative HKS Descriptors and Cross-Modal Neural Networks. 5504-5514
Weilong Peng, Zhiyong Feng, Chao Xu, Yong Su:
Parametric T-Spline Face Morphable Model for Detailed Fitting in Shape Subspace. 5515-5523
Silvia Zuffi, Angjoo Kanazawa, David W. Jacobs, Michael J. Black:
3D Menagerie: Modeling the 3D Shape and Pose of Animals. 5524-5532
Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, Christoph H. Lampert:
iCaRL: Incremental Classifier and Representation Learning. 5533-5542
Mude Lin, Liang Lin, Xiaodan Liang, Keze Wang, Hui Cheng:
Recurrent 3D Pose Sequence Machines. 5543-5552
Elad Richardson, Matan Sela, Roy Or-El, Ron Kimmel:
Learning Detailed Face Reconstruction from a Single Image. 5553-5562
Jie Song, Limin Wang, Luc Van Gool, Otmar Hilliges:
Thin-Slicing Network: A Deep Structured Model for Pose Estimation in Videos. 5563-5572
Federica Bogo, Javier Romero, Gerard Pons-Moll, Michael J. Black:
Dynamic FAUST: Registering Human Bodies in Motion. 5573-5582
Armin Mustafa, Adrian Hilton:
Semantically Coherent Co-Segmentation and Reconstruction of Dynamic Scenes. 5583-5592
Cenek Albl, Zuzana Kukelova, Andrew W. Fitzgibbon, Jan Heller, Matej Smíd, Tomás Pajdla:
On the Two-View Geometry of Unsynchronized Cameras. 5593-5602
Chen Kong, Chen-Hsuan Lin, Simon Lucey:
Using Locally Corresponding CAD Models for Dense 3D Reconstructions from a Single Image. 5603-5611
Jesus Briales, Javier Gonzalez-Jimenez:
Convex Global 3D Registration with Lagrangian Duality. 5612-5621
Benjamin Ummenhofer, Huizhong Zhou, Jonas Uhrig, Nikolaus Mayer, Eddy Ilg, Alexey Dosovitskiy, Thomas Brox:
DeMoN: Depth and Motion Network for Learning Monocular Stereo. 5622-5631
Arsalan Mousavian, Dragomir Anguelov, John Flynn, Jana Kosecka:
3D Bounding Box Estimation Using Deep Learning and Geometry. 5632-5640
Xun Sun, Yuanfan Xie, Pei Luo, Liang Wang:
A Dataset for Benchmarking Image-Based Localization. 5641-5649
Gunnar A. Sigurdsson, Santosh Kumar Divvala, Ali Farhadi, Abhinav Gupta:
Asynchronous Temporal Fields for Action Recognition. 5650-5659
Yao Li, Guosheng Lin, Bohan Zhuang, Lingqiao Liu, Chunhua Shen, Anton van den Hengel:
Sequential Person Recognition in Photo Albums with a Recurrent Network. 5660-5668
Xiao Chu, Wei Yang, Wanli Ouyang, Cheng Ma, Alan L. Yuille, Xiaogang Wang:
Multi-context Attention for Human Pose Estimation. 5669-5678
Liuhao Ge, Hui Liang, Junsong Yuan, Daniel Thalmann:
3D Convolutional Neural Networks for Efficient and Robust Hand Pose Estimation from Single Depth Images. 5679-5688
Denis Tomè, Chris Russell, Lourdes Agapito:
Lifting from the Deep: Convolutional 3D Pose Estimation from a Single Image. 5689-5698
Amlan Kar, Nishant Rai, Karan Sikka, Gaurav Sharma:
AdaScan: Adaptive Scan Pooling in Deep Convolutional Neural Networks for Human Action Recognition in Videos. 5699-5708
Robert Walecki, Ognjen Rudovic, Vladimir Pavlovic, Björn W. Schuller, Maja Pantic:
Deep Structured Learning for Facial Action Unit Intensity Estimation. 5709-5718
Basura Fernando, Hakan Bilen, Efstratios Gavves, Stephen Gould:
Self-Supervised Video Representation Learning with Odd-One-Out Networks. 5729-5738
Christos Sagonas, Yannis Panagakis, Alina Leidinger, Stefanos Zafeiriou:
Robust Joint and Individual Variance Explained. 5739-5748
Wen Wang, Ruiping Wang, Shiguang Shan, Xilin Chen:
Discriminative Covariance Oriented Representation Learning for Face Recognition with Image Sets. 5749-5758
Kazuma Sasaki, Satoshi Iizuka, Edgar Simo-Serra, Hiroshi Ishikawa:
Joint Gap Detection and Inpainting of Line Drawings. 5768-5776
Hyunwoo J. Kim, Nagesh Adluru, Heemanshu Suri, Baba C. Vemuri, Sterling C. Johnson, Vikas Singh:
Riemannian Nonlinear Mixed Effects Models: Analyzing Longitudinal Deformations in Neuroimaging. 5777-5786
Yawen Huang, Ling Shao, Alejandro F. Frangi:
Simultaneous Super-Resolution and Cross-Modality Synthesis of 3D Medical Images Using Weakly-Supervised Joint Convolutional Sparse Coding. 5787-5796
Aviad Levis, Yoav Y. Schechner, Anthony B. Davis:
Multiple-Scattering Microphysics Tomography. 5797-5806
Jia Xu, Rene Ranftl, Vladlen Koltun:
Accurate Optical Flow via Direct Cost Volume Processing. 5807-5815
Alex Zihao Zhu, Nikolay Atanasov, Kostas Daniilidis:
Event-Based Visual Inertial Odometry. 5816-5824
Le Zhang, Jagannadan Varadarajan, Ponnuthurai Nagaratnam Suganthan, Narendra Ahuja, Pierre Moulin:
Robust Visual Tracking Using Oblique Random Forests. 5825-5834
Wei-Sheng Lai, Jia-Bin Huang, Narendra Ahuja, Ming-Hsuan Yang:
Deep Laplacian Pyramid Networks for Fast and Accurate Super-Resolution. 5835-5843
Jian Shi, Yue Dong, Hao Su, Stella X. Yu:
Learning Non-Lambertian Object Intrinsics Across ShapeNet Categories. 5844-5853
Emilio J. Almazan, Ron Tal, Yiming Qian, James H. Elder:
MCMLSD: A Dynamic Programming Approach to Line Segment Detection. 5854-5862
Se-Ho Lee, Won-Dong Jang, Chang-Su Kim:
Contour-Constrained Superpixels for Image and Video Processing. 5863-5871
Yun Liu, Ming-Ming Cheng, Xiaowei Hu, Kai Wang, Xiang Bai:
Richer Convolutional Features for Edge Detection. 5872-5881
Stamatios Lefkimmiatis:
Non-local Color Image Denoising with Convolutional Neural Networks. 5882-5891
Yi Chang, Luxin Yan, Sheng Zhong:
Hyper-Laplacian Regularized Unidirectional Low-Rank Tensor Recovery for Multispectral Image Denoising. 5901-5909
Maggie Wigness, John G. Rogers III:
Unsupervised Semantic Scene Labeling for Streaming Data. 5910-5919
Rang Ho Man Nguyen, Michael S. Brown:
Why You Should Forget Luminance Conversion and Do Something Better. 5920-5928
Je Hyeong Hong, Christopher Zach, Andrew W. Fitzgibbon:
Revisiting the Variable Projection Method for Separable Nonlinear Least Squares Problems. 5939-5947
Marc Tewa Law, Yaoliang Yu, Raquel Urtasun, Richard S. Zemel, Eric P. Xing:
Efficient Multiple Instance Metric Learning Using Weakly Supervised Data. 5948-5956
Thibaut Durand, Taylor Mordan, Nicolas Thome, Matthieu Cord:
WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation. 5957-5966
Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros:
Image-to-Image Translation with Conditional Adversarial Networks. 5967-5976
Yani Ioannou, Duncan P. Robertson, Roberto Cipolla, Antonio Criminisi:
Deep Roots: Improving CNN Efficiency with Hierarchical Filter Groups. 5977-5986
Saining Xie, Ross B. Girshick, Piotr Dollár, Zhuowen Tu, Kaiming He:
Aggregated Residual Transformations for Deep Neural Networks. 5987-5995
Hao Yang, Joey Tianyi Zhou, Jianfei Cai, Yew-Soon Ong:
MIML-FCN+: Multi-Instance Multi-Label Learning via Fully Convolutional Networks with Privileged Information. 5996-6004
Zhengming Ding, Ming Shao, Yun Fu:
Low-Rank Embedded Ensemble Semantic Dictionary for Zero-Shot Learning. 6005-6013
Zhiwei Deng, Rajitha Navarathna, Peter Carr, Stephan Mandt, Yisong Yue, Iain A. Matthews, Greg Mori:
Factorized Variational Autoencoders for Modeling Audience Reactions to Movies. 6014-6023
Deepak Pathak, Ross B. Girshick, Piotr Dollár, Trevor Darrell, Bharath Hariharan:
Learning Features by Watching Objects Move. 6024-6033
Rodrigo Santa Cruz, Basura Fernando, Anoop Cherian, Stephen Gould:
DeepPermNet: Visual Permutation Learning. 6044-6052
Mengjiao Wang, Yannis Panagakis, Patrick Snape, Stefanos Zafeiriou:
Learning the Multilinear Structure of Visual Data. 6053-6061
Lena Gorelick, Yuri Boykov, Olga Veksler:
Adaptive and Move Making Auxiliary Cuts for Binary Pairwise Energies. 6062-6070
Tien-Ju Yang, Yu-Hsin Chen, Vivienne Sze:
Designing Energy-Efficient Convolutional Neural Networks Using Energy-Aware Pruning. 6071-6079
Fangting Xia, Peng Wang, Xianjie Chen, Alan L. Yuille:
Joint Multi-person Pose Estimation and Semantic Part Segmentation. 6080-6089
Paul Upchurch, Jacob R. Gardner, Geoff Pleiss, Robert Pless, Noah Snavely, Kavita Bala, Kilian Q. Weinberger:
Deep Feature Interpolation for Image Content Changes. 6090-6099
Xiyang Dai, Joe Yue-Hei Ng, Larry S. Davis:
FASON: First and Second Order Information Fusion Network for Texture Recognition. 6100-6108
Steve Branson, Grant Van Horn, Pietro Perona:
Lean Crowdsourcing: Combining Humans and Machines in an Online System. 6109-6118
Youngjae Yu, Jongwook Choi, Yeonhwa Kim, Kyung Yoo, Sang-Hun Lee, Gunhee Kim:
Supervising Neural Attention Models for Video Captioning by Human Gaze Data. 6119-6127
Yurun Tian, Bin Fan, Fuchao Wu:
L2-Net: Deep Learning of Discriminative Patch Descriptor in Euclidean Space. 6128-6136
Gedas Bertasius, Lorenzo Torresani, Stella X. Yu, Jianbo Shi:
Convolutional Random Walk Networks for Semantic Image Segmentation. 6137-6145
Yuke Zhu, Joseph J. Lim, Li Fei-Fei:
Knowledge Acquisition for Visual Question Answering via Iterative Querying. 6146-6155
Bo Zhao, Jiashi Feng, Xiao Wu, Shuicheng Yan:
Memory-Augmented Attribute Manipulation Networks for Interactive Fashion Search. 6156-6164
Yang Long, Li Liu, Ling Shao, Fumin Shen, Guiguang Ding, Jungong Han:
From Zero-Shot Learning to Conventional Supervised Classification: Unseen Visual Data Synthesis. 6165-6174
Torsten Sattler, Akihiko Torii, Josef Sivic, Marc Pollefeys, Hajime Taira, Masatoshi Okutomi, Tomás Pajdla:
Are Large-Scale 3D Models Really Necessary for Accurate Visual Localization? 6175-6184
Giorgos Tolias, Ondrej Chum:
Asymmetric Feature Maps with Application to Sketch Based Retrieval. 6185-6193
Kan Chen, Trung Bui, Chen Fang, Zhaowen Wang, Ram Nevatia:
AMC: Attention Guided Multi-modal Correlation Learning for Image Search. 6203-6211
Peng Wang, Lingqiao Liu, Chunhua Shen, Zi Huang, Anton van den Hengel, Heng Tao Shen:
Multi-attention Network for One Shot Learning. 6212-6220
Weixiang Hong, Junsong Yuan, Sreyasee Das Bhattacharjee:
Fried Binary Embedding for High-Dimensional Visual Features. 6221-6229
Hengshuang Zhao, Jianping Shi, Xiaojuan Qi, Xiaogang Wang, Jiaya Jia:
Pyramid Scene Parsing Network. 6230-6239
Haoliang Sun, Xiantong Zhen, Yuanjie Zheng, Gongping Yang, Yilong Yin, Shuo Li:
Learning Deep Match Kernels for Image-Set Classification. 6240-6249
Xishan Zhang, Ke Gao, Yongdong Zhang, Dongming Zhang, Jintao Li, Qi Tian:
Task-Driven Dynamic Fusion: Reducing Ambiguity in Video Description. 6250-6258
Haomiao Liu, Ruiping Wang, Shiguang Shan, Xilin Chen:
Learning Multifunctional Binary Codes for Both Category and Attribute Oriented Retrieval Tasks. 6259-6268
Wei Zhuo, Mathieu Salzmann, Xuming He, Miaomiao Liu:
Indoor Scene Parsing with Instance Segmentation, Semantic Labeling and Support Relationship Inference. 6269-6277
Abrar H. Abdulnabi, Bing Shuai, Stefan Winkler, Gang Wang:
Episodic CAMN: Contextual Attention-Based Memory Networks with Iterative Feedback for Scene Labeling. 6278-6287
Mohamed Elhoseiny, Yizhe Zhu, Han Zhang, Ahmed M. Elgammal:
Link the Head to the "Beak": Zero Shot Learning from Noisy Text Description at Part Precision. 6288-6297
Long Chen, Hanwang Zhang, Jun Xiao, Liqiang Nie, Jian Shao, Wei Liu, Tat-Seng Chua:
SCA-CNN: Spatial and Channel-Wise Attention in Convolutional Networks for Image Captioning. 6298-6306

Yash Goyal, Tejas Khot, Douglas Summers-Stay, Dhruv Batra, Devi Parikh:
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering. 6325-6334
Mark Yatskar, Vicente Ordonez, Luke Zettlemoyer, Ali Farhadi:
Commonly Uncommon: Semantic Sparsity in Situation Recognition. 6335-6344
Hong Liu, Rongrong Ji, Yongjian Wu, Feiyue Huang, Baochang Zhang:
Cross-Modality Binary Code Learning via Fusion Similarity Hashing. 6345-6353
Hamed R. Tavakoli, Fawad Ahmed, Ali Borji, Jorma Laaksonen:
Saliency Revisited: Analysis of Mouse Movements Versus Fixations. 6354-6362
Shay Zweig, Lior Wolf:
InterpoNet, a Brain Inspired Neural Network for Optical Flow Dense Interpolation. 6363-6372
Shyamal Buch, Victor Escorcia, Chuanqi Shen, Bernard Ghanem, Juan Carlos Niebles:
SST: Single-Stream Temporal Action Proposals. 6373-6382
Rui Yang, Bingbing Ni, Chao Ma, Yi Xu, Xiaokang Yang:
Video Segmentation via Multiple Granularity Analysis. 6383-6392
Seyed Morteza Safdarnejad, Xiaoming Liu:
Spatio-Temporal Alignment of Non-overlapping Sequences from Independently Panning Cameras. 6393-6401
Limin Wang, Yuanjun Xiong, Dahua Lin, Luc Van Gool:
UntrimmedNets for Weakly Supervised Action Recognition and Detection. 6402-6411
Nour Karessli, Zeynep Akata, Bernt Schiele, Andreas Bulling:
Gaze Embeddings for Zero-Shot Image Classification. 6412-6421
Siddha Ganju, Olga Russakovsky, Abhinav Gupta:
What's in a Question: Using Visual Questions as a Form of Supervision. 6422-6431
Cesc Chunseong Park, Byeongchang Kim, Gunhee Kim:
Attend to You: Personalized Image Captioning with Context Sequence Memory Networks. 6432-6440
V. S. R. Veeravasarapu, Constantin A. Rothkopf, Visvanathan Ramesh:
Adversarially Tuned Scene Generation. 6441-6449
Fei Wang, Mengqing Jiang, Chen Qian, Shuo Yang, Cheng Li, Honggang Zhang, Xiaogang Wang, Xiaoou Tang:
Residual Attention Network for Image Classification. 6450-6458
Xiaoxiao Li, Ziwei Liu, Ping Luo, Chen Change Loy, Xiaoou Tang:
Not All Pixels Are Equal: Difficulty-Aware Semantic Segmentation via Deep Layer Cascade. 6459-6468
Mohit Iyyer, Varun Manjunatha, Anupam Guha, Yogarshi Vyas, Jordan L. Boyd-Graber, Hal Daumé III, Larry S. Davis:
The Amazing Mysteries of the Gutter: Drawing Inferences Between Panels in Comic Book Narratives. 6478-6487
Yunchao Wei, Jiashi Feng, Xiaodan Liang, Ming-Ming Cheng, Yao Zhao, Shuicheng Yan:
Object Region Mining with Adversarial Erasing: A Simple Classification to Semantic Segmentation Approach. 6488-6496
Michael Lam, Behrooz Mahasseni, Sinisa Todorovic:
Fine-Grained Recognition as HSnet Search for Informative Image Parts. 6497-6506
Qilong Wang, Peihua Li, Lei Zhang:
G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition. 6507-6516
Xiaozhi Chen, Huimin Ma, Ji Wan, Bo Li, Tian Xia:
Multi-view 3D Object Detection Network for Autonomous Driving. 6526-6534
Sean Ryan Fanello, Julien P. C. Valentin, Christoph Rhemann, Adarsh Kowdle, Vladimir Tankovich, Philip L. Davidson, Shahram Izadi:
UltraStereo: Efficient Learning-Based Matching for Active Stereo Systems. 6535-6544
Angela Dai, Charles Ruizhongtai Qi, Matthias Nießner:
Shape Completion Using 3D-Encoder-Predictor CNNs and Shape Synthesis. 6545-6554
Alex Kendall, Roberto Cipolla:
Geometric Loss Functions for Camera Pose Regression with Deep Learning. 6555-6564
Keisuke Tateno, Federico Tombari, Iro Laina, Nassir Navab:
CNN-SLAM: Real-Time Dense Monocular SLAM with Learned Depth Prediction. 6565-6574
Andreas Veit, Neil Alldrin, Gal Chechik, Ivan Krasin, Abhinav Gupta, Serge J. Belongie:
Learning from Noisy Large-Scale Datasets with Minimal Supervision. 6575-6583
Li Yi, Hao Su, Xingwen Guo, Leonidas J. Guibas:
SyncSpecCNN: Synchronized Spectral CNN for 3D Shape Segmentation. 6584-6592
Zhiming Luo, Akshaya Kumar Mishra, Andrew Achkar, Justin A. Eichel, Shaozi Li, Pierre-Marc Jodoin:
Non-local Deep Features for Salient Object Detection. 6593-6601
Clément Godard, Oisin Mac Aodha, Gabriel J. Brostow:
Unsupervised Monocular Depth Estimation with Left-Right Consistency. 6602-6611
Tinghui Zhou, Matthew Brown, Noah Snavely, David G. Lowe:
Unsupervised Learning of Depth and Ego-Motion from Video. 6612-6619
Gernot Riegler, Ali Osman Ulusoy, Andreas Geiger:
OctNet: Learning Deep 3D Representations at High Resolutions. 6620-6629
Evangelos Kalogerakis, Melinos Averkiou, Subhransu Maji, Siddhartha Chaudhuri:
3D Shape Segmentation with Projective Convolutional Networks. 6630-6639
Yiming Qian, Minglun Gong, Yee-Hong Yang:
Stereo-Based 3D Reconstruction of Dynamic Fluid Surfaces by Global Optimization. 6650-6659
Oliver Zendel, Katrin Honauer, Markus Murschitz, Martin Humenberger, Gustavo Fernández Domínguez:
Analyzing Computer Vision Data - The Good, the Bad and the Ugly. 6670-6680
Matthias Vestner, Roee Litman, Emanuele Rodolà, Alexander M. Bronstein, Daniel Cremers:
Product Manifold Filter: Non-rigid Shape Correspondence via Kernel Density Estimation in the Product Space. 6681-6690
Michel Antunes, Joao P. Barreto, Djamila Aouada, Björn Ottersten:
Unsupervised Vanishing Point Detection and Camera Calibration from a Single Manhattan Image with Radial Distortion. 6691-6699
Federico Camposeco, Torsten Sattler, Andrea Cohen, Andreas Geiger, Marc Pollefeys:
Toroidal Constraints for Two-Point Localization Under High Outlier Ratios. 6700-6708
Yuan Gao, Alan L. Yuille:
Exploiting Symmetry and/or Manhattan Properties for 3D Object Structure Estimation from Single and Multiple Images. 6718-6727
Jie Qin, Li Liu, Ling Shao, Bingbing Ni, Chen Chen, Fumin Shen, Yunhong Wang:
Binary Coding for Partial Action Analysis with Limited Observation Ratios. 6728-6737
Weiyang Liu, Yandong Wen, Zhiding Yu, Ming Li, Bhiksha Raj, Le Song:
SphereFace: Deep Hypersphere Embedding for Face Recognition. 6738-6746
Hugo Proença, João C. Neves:
IRINA: Iris Recognition (Even) in Inaccurately Segmented Data. 6747-6756
Ke Gong, Xiaodan Liang, Dongyu Zhang, Xiaohui Shen, Liang Lin:
Look into Person: Self-Supervised Structure-Sensitive Learning and a New Benchmark for Human Parsing. 6757-6765
Wei Li, Farnaz Abtahi, Zhigang Zhu:
Action Unit Detection with Region Adaptation, Multi-labeling Learning and Optimal Temporal Fusing. 6766-6775
Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, Tieniu Tan:
See the Forest for the Trees: Joint Spatial and Temporal Recurrent Neural Networks for Video-Based Person Re-identification. 6776-6785
Yasushi Makihara, Atsuyuki Suzuki, Daigo Muramatsu, Xiang Li, Yasushi Yagi:
Joint Intensity and Spatial Metric Learning for Robust Gait Recognition. 6786-6796
Vijay Kumar, Anoop M. Namboodiri, Manohar Paluri, C. V. Jawahar:
Pose-Aware Person Recognition. 6797-6806
José Lezama, Qiang Qiu, Guillermo Sapiro:
Not Afraid of the Dark: NIR-VIS Face Recognition via Cross-Spectral Hallucination and Low-Rank Embedding. 6807-6816
Katsuyuki Nakamura, Serena Yeung, Alexandre Alahi, Li Fei-Fei:
Jointly Learning Energy Expenditures and Activities Using Egocentric Multimodal Signals. 6817-6826
Wei Zhang, Xiaochun Cao, Rui Wang, Yuanfang Guo, Zhineng Chen:
Binarized Mode Seeking for Scalable Visual Pattern Discovery. 6827-6835
Patsorn Sangkloy, Jingwan Lu, Chen Fang, Fisher Yu, James Hays:
Scribbler: Controlling Deep Image Synthesis with Sketch and Color. 6836-6845
Lifang He, Chun-Ta Lu, Hao Ding, Shen Wang, Linlin Shen, Philip S. Yu, Ann B. Ragin:
Multi-way Multi-level Kernel Modeling for Neuroimaging Classification. 6846-6854
Xinliang Zhu, Jiawen Yao, Feiyun Zhu, Junzhou Huang:
WSISA: Making Survival Prediction from Whole Slide Histopathological Images. 6855-6863
Tali Dekel, Michael Rubinstein, Ce Liu, William T. Freeman:
On the Effectiveness of Visible Watermarks. 6864-6872
Zhiwei Xiong, Lizhi Wang, Huiqun Li, Dong Liu, Feng Wu:
Snapshot Hyperspectral Light Field Imaging. 6873-6881
Raymond A. Yeh, Chen Chen, Teck-Yian Lim, Alexander G. Schwing, Mark Hasegawa-Johnson, Minh N. Do:
Semantic Image Inpainting with Deep Generative Models. 6882-6890
Tatsunori Taniai, Sudipta N. Sinha, Yoichi Sato:
Fast Multi-frame Stereo Scene Flow with Motion Segmentation. 6891-6900
Amit Shaked, Lior Wolf:
Improved Stereo Matching with Constant Highway Networks and Reflective Confidence Learning. 6901-6910
Tal Schuster, Lior Wolf, David Gadot:
Optical Flow Requires Multiple Strategies (but Only One Network). 6921-6930
Martin Danelljan, Goutam Bhat, Fahad Shahbaz Khan, Michael Felsberg:
ECO: Efficient Convolution Operators for Tracking. 6931-6939
Jia Xue, Hang Zhang, Kristin J. Dana, Ko Nishino:
Differential Angular Imaging for Material Recognition. 6940-6949
Johannes L. Schönberger, Hans Hardmeier, Torsten Sattler, Marc Pollefeys:
Comparative Evaluation of Hand-Crafted and Learned Local Features. 6959-6968
Jiawei Zhang, Jin-shan Pan, Wei-Sheng Lai, Rynson W. H. Lau, Ming-Hsuan Yang:
Learning Fully Convolutional Networks for Iterative Non-blind Deconvolution. 6969-6977
Yanyang Yan, Wenqi Ren, Yuanfang Guo, Rui Wang, Xiaochun Cao:
Image Deblurring via Extreme Channels Prior. 6978-6986
Liyuan Pan, Yuchao Dai, Miaomiao Liu, Fatih Porikli:
Simultaneous Stereo Video Deblurring and Scene Flow Estimation. 6987-6996
Takuhiro Kaneko, Kaoru Hiramatsu, Kunio Kashino:
Generative Attribute Controller with Conditional Filtered Generative Adversarial Networks. 7006-7015
Jing Zhang, Yang Cao, Shuai Fang, Yu Kang, Chang Wen Chen:
Fast Haze Removal for Nighttime Image Using Maximum Reflectance Prior. 7016-7024

Haozhi Huang, Hao Wang, Wenhan Luo, Lin Ma, Wenhao Jiang, Xiaolong Zhu, Zhifeng Li, Wei Liu:
Real-Time Neural Style Transfer for Videos. 7044-7052
Shijie Yang, Liang Li, Shuhui Wang, Weigang Zhang, Qingming Huang:
A Graph Regularized Deep Neural Network for Unsupervised Image Representation Learning. 7053-7061
Paul Swoboda, Carsten Rother, Hassan Abu Alhaija, Dagmar Kainmüller, Bogdan Savchynskyy:
A Study of Lagrangean Decompositions and Dual Ascent Solvers for Graph Matching. 7062-7071
Xiangyu Kong, Bo Xin, Yizhou Wang, Gang Hua:
Collaborative Deep Reinforcement Learning for Joint Object Search. 7072-7081
Samuel Rota Bulò, Gerhard Neuhold, Peter Kontschieder:
Loss Max-Pooling for Semantic Image Segmentation. 7082-7091
Zelun Luo, Boya Peng, De-An Huang, Alexandre Alahi, Li Fei-Fei:
Unsupervised Learning of Long-Term Motion Dynamics for Videos. 7101-7110
Luping Zhou, Lei Wang, Jianjia Zhang, Yinghuan Shi, Yang Gao:
Revisiting Metric Learning for SPD Matrix Based Visual Representation. 7111-7119
Rahaf Aljundi, Punarjay Chakravarty, Tinne Tuytelaars:
Expert Gate: Lifelong Learning with a Network of Experts. 7120-7129
Junho Yim, Donggyu Joo, Jihoon Bae, Junmo Kim:
A Gift from Knowledge Distillation: Fast Optimization, Network Minimization and Transfer Learning. 7130-7138
Piotr Koniusz, Yusuf Tas, Fatih Porikli:
Domain Adaptation by Mixture of Alignments of Second-or Higher-Order Scatter Tensors. 7139-7148
Stéphane Lathuilière, Remi Juge, Pablo Mesejo, Rafael Muñoz-Salinas, Radu Horaud:
Deep Mixture of Linear Inverse Regressions Applied to Head-Pose Estimation. 7149-7157
Yang He, Wei-Chen Chiu, Margret Keuper, Mario Fritz:
STD2P: RGBD Semantic Segmentation Using Spatio-Temporal Data-Driven Pooling. 7158-7167
Daniel E. Worrall, Stephan J. Garbin, Daniyar Turmukhambetov, Gabriel J. Brostow:
Harmonic Networks: Deep Translation and Rotation Equivariance. 7168-7177
Xin Wang, Geoffrey Oxholm, Da Zhang, Yuan-Fang Wang:
Multimodal Transfer: A Hierarchical Deep Convolutional Neural Network for Fast Artistic Style Transfer. 7178-7186
Spyros Gidaris, Nikos Komodakis:
Detect, Replace, Refine: Deep Structured Prediction for Pixel Wise Labeling. 7187-7196
Eunhyeok Park, Junwhan Ahn, Sungjoo Yoo:
Weighted-Entropy-Based Quantization for Deep Neural Networks. 7197-7205
Daiki Ikami, Toshihiko Yamasaki, Kiyoharu Aizawa:
Residual Expansion Algorithm: Fast and Effective Optimization for Nonconvex Least Squares Problems. 7206-7214
Qing Sun, Stefan Lee, Dhruv Batra:
Bidirectional Beam Search: Forward-Backward Inference in Neural Sequence Models for Fill-in-the-Blank Image Captioning. 7215-7223
Hariprasad Kannan, Nikos Komodakis, Nikos Paragios:
Newton-Type Methods for Inference in Higher-Order Markov Random Fields. 7224-7233
Zheng Xu, Mário A. T. Figueiredo, Xiaoming Yuan, Christoph Studer, Tom Goldstein:
Adaptive Relaxed ADMM: Convergence Theory and Practical Implementation. 7234-7243
Yikang Li, Wanli Ouyang, Xiaogang Wang, Xiaoou Tang:
ViP-CNN: Visual Phrase Guided Convolutional Neural Network. 7244-7253
Yan Huang, Wei Wang, Liang Wang:
Instance-Aware Image and Sentence Matching with Selective Multimodal LSTM. 7254-7262
Rafael S. Rezende, Joaquin Zepeda, Jean Ponce, Francis R. Bach, Patrick Pérez:
Kernel Square-Loss Exemplar Machines for Image Retrieval. 7263-7271
Saurabh Gupta, James Davidson, Sergey Levine, Rahul Sukthankar, Jitendra Malik:
Cognitive Mapping and Planning for Visual Navigation. 7272-7281
Anirban Roy, Sinisa Todorovic:
Combining Bottom-Up, Top-Down, and Smoothness Cues for Weakly Supervised Image Segmentation. 7282-7291
Ya-Fang Shih, Yang-Ming Yeh, Yen-Yu Lin, Ming-Fang Weng, Yi-Chang Lu, Yung-Yu Chuang:
Deep Co-occurrence Feature Learning for Visual Object Recognition. 7302-7311
Santhosh K. Ramakrishnan, Ambar Pal, Gaurav Sharma, Anurag Mittal:
An Empirical Evaluation of Visual Question Answering for Novel Objects. 7312-7321
Alexander Kirillov, Evgeny Levinkov, Bjoern Andres, Bogdan Savchynskyy, Carsten Rother:
InstanceCut: From Edges to Instances with MultiCut. 7322-7331
Xiangteng He, Yuxin Peng:
Fine-Grained Image Classification via Combining Vision and Language. 7332-7340
Quanquan Li, Shengying Jin, Junjie Yan:
Mimicking Very Efficient Network for Object Detection. 7341-7349
Zhenyang Li, Ran Tao, Efstratios Gavves, Cees G. M. Snoek, Arnold W. M. Smeulders:
Tracking by Natural Language Specification. 7350-7358
Tegan Maharaj, Nicolas Ballas, Anna Rohrbach, Aaron C. Courville, Christopher Joseph Pal:
A Dataset and Exploration of Models for Understanding Video Data through Fill-in-the-Blank Question-Answering. 7359-7368
Yufei Wang, Zhe Lin, Xiaohui Shen, Scott Cohen, Garrison W. Cottrell:
Skeleton Key: Image Captioning by Skeleton-Attribute Decomposition. 7378-7387
Arnon Amir, Brian Taba, David J. Berg, Timothy Melano, Jeffrey L. McKinstry, Carmelo di Nolfo, Tapan K. Nayak, Alexander Andreopoulos, Guillaume Garreau, Marcela Mendoza, Jeff Kusnitz, Michael DeBole, Steven K. Esser, Tobi Delbrück, Myron Flickner, Dharmendra S. Modha:
A Low Power, Fully Event-Based Gesture Recognition System. 7388-7397
Dangwei Li, Xiaotang Chen, Zhang Zhang, Kaiqi Huang:
Learning Deep Context-Aware Features over Body and Latent Parts for Person Re-identification. 7398-7407
Minsi Wang, Bingbing Ni, Xiaokang Yang:
Recurrent Modeling of Interaction Context for Collective Activity Recognition. 7408-7416
Yeong Jun Koh, Chang-Su Kim:
Primary Object Segmentation in Videos Based on Region Augmentation and Reduction. 7417-7425
Ondrej Miksik, Juan-Manuel Perez-Rua, Philip H. S. Torr, Patrick Pérez:
ROAM: A Rich Object Appearance Model with Application to Rotoscoping. 7426-7434
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes:
Temporal Residual Networks for Dynamic Scene Recognition. 7435-7444
Christoph Feichtenhofer, Axel Pinz, Richard P. Wildes:
Spatiotemporal Multiplier Networks for Video Action Recognition. 7445-7454
Serena Yeung, Vignesh Ramanathan, Olga Russakovsky, Liyue Shen, Greg Mori, Li Fei-Fei:
Learning to Learn from Noisy Web Videos. 7455-7463
Esteban Real, Jonathon Shlens, Stefano Mazzocchi, Xin Pan, Vincent Vanhoucke:
YouTube-BoundingBoxes: A Large High-Precision Human-Annotated Data Set for Object Detection in Video. 7464-7473
Won-Dong Jang, Chang-Su Kim:
Online Video Object Segmentation via Convolutional Trident Network. 7474-7483



Google
Google Scholar
MS Academic
CiteSeerX
CORE
Semantic Scholar
